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gggcggtgtc cggccgttcg agcgcgtgga cgccaac atg age gc 

Met Ser Al 
1 

ggc etc ctg acg ctg gcg cct gcg 
Gly Leu Leu Thr Leu Ala Pro Ala 



gcc aag acc 
Ala Lys Thr 



ttc etc gtg 
Phe Leu Val 

25 

acg cga ctt 
Thr Arg Leu 
40 

gtc gcg cga 
Val Ala Arg 



gcc gtg 
Ala Val 
10 

act gtc 
Thr Val 

ggc gtc 
Gly Val 

etc acg 
Leu Thr 



ctg ggc acg 
Leu Gly Thr 
30 

ccg aaa age 
Pro Lys Ser 

45 

etc tgg ggg 
Leu Trp Gly 



acg ctg 
Thr Leu 
15 

tac ggg 
Tyr Gly 

ttc gtg 
Phe Val 

ctt ggg 
Leu Gly 



g tgg acg agg 
a Trp Thr Arg 
5 

egg ata gtg 
Arg He Val 
20 

etc acg gtc gcg gcc tgc 
Leu Thr Val Ala Ala Cys 

' ^ 35 ■ 
ctg ggc ctg 
Leu Gly Leu 
50 

tac cac 



ttc 



Phe Tyr His 



acg egg tgc 
Thr Arg Cys 

a^tt gag gtc 
lie Glu Val 



55 



103 



151 



199 



247 



2 



55 60 65 70 

tct tgc gac gcc caa ggc ctt egg gag tgg ccg cgc gtg att gtc gcg 295 

Ser Cys Asp Ala Gin Gly Leu Arg Glu Trp Pro Arg Val lie Val Ala 

75 80 85 

aac cac gtc teg tae etg gag ate ttg tae tte atg teg aee gtg eae 343 
Asn His Val Ser Tyr Leu Glu lie Leu Tyr Phe Met Ser Thr Val His 

90 95 100 

tge eeg tct tte gtc atg aag aag ace tgc etc ega gtc ccg ctt gtc 391 
Cys Pro Ser Phe Val Met Lys Lys Thr Cys Leu Arg Val Pro Leu Val 

105 110 115 

ggc tac att gcc atg gag ctg ggc ggt gtg att gtg gac cgc gag ggc 439 
Gly Tyr He Ala Met Glu Leu Gly Gly Val He Val Asp Arg Glu. Gly 

120 125 130 

ggc ggt caa age gca teg gcg ate att cgc gac cgc gtg cag gag ect 487 
Gly Gly Gin Ser Ala Ser Ala He He Arg Asp Arg Val Gin Glu Pro 
135 140 145 150 

ect ega gat teg teg age gag aag eae cac gcg eag eeg ctt ctt gtg 535 
Pro Arg Asp Ser Ser Ser Glu Lys His His Ala Gin Pro Leu Leu Val 

155 160 165 

tte ece gag ggg aee aee aee aat gga age tge etg etc eaa tte aag 583 
Phe Pro Glu Gly Thr Thr Thr Asn Gly Ser Cys Leu Leu Gin Phe Lys 

170 175 180 

acg gga gee ttt egt ect ggg get ccg gtg ctt eeg gte gtg ctt gag 631 
Thr Gly Ala Phe Arg Pro Gly Ala Pro Val Leu Pro Val Val Leu Glu 

185 190 195 

ttt ccg att gac aaa gcg cgt ggt gac ttt tec ccg gcg tac gaa teg 679 
Phe Pro He Asp Lys Ala Arg Gly Asp Phe Ser Pro Ala Tyr Glu Ser 

200 205 210 

gtc cac acg cea get cac etc ctt cgc atg etc gca caa tgg agg cac 727 
Val His Thr Pro Ala His Leu Leu Arg Met Leu Ala Gin Trp Arg His 
215 220 225 230 

egg ctt egg gtg ege tat ctt ect etg tat gag eee tct gcg get gag 775 
Arg Leu Arg Val Arg Tyr Leu Pro Leu Tyr Glu Pro Ser Ala Ala Glu 

235 240 245 

aag gtt gat gea gac ctt tat gcg egg aac gtg cgc gae gaa atg gcg 823 
Lys Val Asp Ala Asp Leu Tyr Ala Arg Asn Val Arg Asp Glu Met Ala 

250 255 260 

cgc gcg etc aag gta ccc act gtg gag cag tct tac cgc gac aag etc 871 
Arg Ala Leu Lys Val Pro Thr Val Glu Gin Ser Tyr Arg Asp Lys Leu 

265 270 275 

gte tae eae gcg gat etc atg ccg eae tac cag aag gee ggc eee gga 919 
Val Tyr His Ala Asp Leu Met Pro His Tyr Gin Lys Ala Gly Pro Gly 

280 285 290 

gcg etc tat etg tae gtc ega ect gae etc ttg tagcactcat gcgcgtccca 972 
Ala Leu Tyr Leu Tyr Val Arg Pro Asp Leu Leu 
295 300 305 

ageggteeag eaaegggaga ttaaaacacg atttcttagc ctacaaaaaa aaaaaaaaaa 1032 
aaaaaaaaaa aaaaa 1047 
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T 1 o 


vax 


Asp 


Arg 
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Pro 


Ser 


Axa 


Val 


Arg 


Asp 


Glu 








260 


Ser 


Tyr 


Arg 


Asp 






275 




Gin 


Lys 


Ala 


Gly 




290 






Leu 








305 









Thr Arg Ala Lys 

5 

He Val Phe Leu 

Ala Cys Thr Arg 
40 

Arg Cys Val Ala 
55 

Glu Val Ser Cys 

70 

Val Ala Asn His 
85 

Val His Cys Pro 

Leu Val Gly Tyr 
120 

Glu Gly Gly Gly 

135 

Glu Pro Pro Arg 
150 

Leu Val Phe Pro 

165 

Phe Lys Thr Gly 

Leu Glu Phe Pro 

200 

Glu Ser Val His 
215 

Arg His Arg Leu 
230 

Ala Glu Lys Val 
245 

Met Ala Arg Ala 

Lys Leu Val Tyr 
280 

Pro Gly Ala Leu 
295 



Thr Ala Val Gly 

10 

Val Thr Val Leu 
25 

Leu Gly Val Pro 

Arg Leu Thr Leu 
60 

Asp Ala Gin Gly 

75 

Val Ser Tyr Leu 
90 

Ser Phe Val Met 
105 

He Ala Met Glu 

Gin Ser Ala Ser 

140 

Asp Ser Ser Ser 
155 

Glu Gly Thr Thr 
170 

Ala Phe Arg Pro 
185 

He Asp Lys Ala 

Thr Pro Ala His 
220 

Arg Val Arg Tyr 

235 

Asp Ala Asp Leu 
250 

Leu Lys Val Pro 

265 

His Ala Asp Leu 

Tyr Leu Tyr Val 
300 



Leu 


Leu 


Thr 


Leu 






15 




Gly 


Thr 


Tyr 


Gly 










Lys 


Ser 


Phe 


Val 










lit-* 


Gly 


Leu Gly 


Leu 


Arg 


Glu 


Trp 








80 


Glu 


He 


Leu 


Tyr 






95 
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Thr 


Cys 




1 1 n 

± X u 






Leu 
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Gly Val 


1 ^ o 










J. JL C 


He 


Arg 


Glu 


Lys 


His 


His 
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i nr 


A c n 


Gly 


Ser 
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Pro 


Val 
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Arg 


VjX y 


Asp 


Phe 
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Leu 


Leu 


Arg 


Met 


Leu 


Pro 


Leu 


Tyr 








240 


Tyr 


Ala 


Arg 


Asn 






255 




Thr 


Val 


Glu 


Gin 




270 






Met 


Pro 


His 


Tyr 


285 








Arg 


Pro 


Asp 


Leu 
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<223> LPAAT 
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ggcacgaggg 
aggtatgtct 
aactgaaaac 
aggtccatca 
tttgcaattt 
cttcacttca 
ggatatggat 
gatggtcatg 
tcggaaaggc 
cttgtttggt 
ggatgctccc 
gctagtcgtg 
ttgatgtata 
agtaaatcat 
aatgtgattg 
gagaaaggct 
tgcctttctc 
aagcggtgtc 
cacattcgcc 
ctgtatgatc 
ttccctgaca 
gctctacacg 
aagctttatg 
cctaaaccgg 
agttgtaatt 
taagctgatt 
atacggttgt 
ctcctttacc 
cccaaaaaaa 



aaattggctt 
tattaatgaa 
ttgttttaat 
taatgtagca 
tcttttgctg 
ttcttatggc 
gttgtaaagg 
tgcaaccatc 
aagattgggt 
tgggcatttt 
gtcatcaaga 
tttcctgaag 
tctgcaatct 
acttgcttaa 
actactatga 
atccggagct 
aat cgcgctg 
ccttgttcat 
gaataccaat 
tattttatca 
gtggaattga 
tagtccttag 
tggctttcgc 
tttactctag 
ggtttgttta 
tttgctatta 
cttctgctga 
catcttgacc 
aaaaaaaaaa 



tctatgtggc 
aatgtctcca 
tttttcttaa 
atatgatcag 
ttaacctatt 
aatgaatgtc 
tgatttttgc 
gtaccgaagt 
actgcaagta 
acgtttttga 
catacattga 
gcacagattt 
tctctataat 
tgtacttagc 
cagtgaagcg 
tgtcaatgtg 
ctctttggat 
caacaatgta 
ttctgagatt 
aaaggaccag 
agagagccct 
cggttgggta 
tagtttgctg 
tctacgtact 
ttccgattcc 
attggctgct 
tgaacctcag 
agatgccaaa 
a 



cgtacttatt 
cacatgtatg 
attgaaattc 
aagcgctcaa 
gattatgttg 
gtgatgattc 
aggtgataaa 
ggactggatg 
tgcggtgaag 
gtttctgatg 
cagttttcaa 
ttcgtaaggc 
ctgcatttat 
aatttgtcat 
aaacgggaca 
cttcaacctc 
gcagtttatg 
ttcggaaccg 
cctcaatcag 
atgttggcca 
ttgaacatag 
ttctggtgct 
ctcgcgttta 
aaaagaaaaa 
aaagctgagt 
cccttgtttg 
tgcttcaaga 
ctcgcaataa 



cgaggaggtc 
ttgtttaggt 
tgtgcctgaa 
atgtgtcgtg 
gaaccacaat 
cggttaattt 
gtacctaagg 
tacatttgga 
aactcagtga 
ctgcatagaa 
gataaaagag 
tgaagtaccc 
tctctgttgt 
ttttgactta 
cgggcaatgc 
gcactcgtgg 
acctcactat 
atccatcgga 
aagacggtat 
gttttagtaa 
tggaaggtgt 
tgtttcattc 
gtacctattt 
tcgtgtaaaa 
ttaagggtga 
tctgccgtaa 
cgatgtggcc 
agcagatcaa 



aacgaaacaa 
atattctgtc 
agccaactct 
aaagtttgct 
acagacgctg 
catcctacag 
agaaccgtgt 
acttagcaat 
aaaacttacc 
agtgggaagt 
atcctctctg 
atccatggct 
ttctctagca 
ttgtgatgta 
aattggaaga 
ctttgtgact 
agggtacaag 
agtgcacatt 
gacgcagtgg 
gacaggctct 
ttgcaatgtt 
ggtttggttg 
tgattggaga 
taaattcgtt 
ggctcctctt 
attggcttta 
ttttagcctt 
taggtcgtgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1701 
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atg get ttg atg tat ate tgc aat ctt 
Met Ala Leu Met Tyr lie Cys Asn Leu 
1 5 

tct gtt gtt tct eta gea agt aaa tea 
Ser Val Val Ser Leu Ala Ser Lys Ser 
20 25 

aat ttg tea ttt ttg act tat tgt gat 
Asn Leu Ser Phe Leu Thr Tyr Cys Asp 

35 40 
gac agt gaa gcg aaa egg gac acg ggc 
Asp Ser Glu Ala Lys Arg Asp Thr Gly 
50 55 



ete tat aat etg eat tta tte 48 
Leu Tyr Asn Leu His Leu Phe 
10 15 
tae ttg ctt aat gta ett age 96 
Tyr Leu Leu Asn Val Leu Ser 
30 

gta aat gtg att gac tac tat 144 
Val Asn Val lie Asp Tyr Tyr 
45 

aat gca att gga aga gag aaa 192 
Asn Ala lie Gly Arg Glu Lys 
60 
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etc 


act 


ata 


Qcra 
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aag 


aag 
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ate 


aac 


aat 


ata 


336 
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Gly 
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Arg 
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Pro 


Leu 
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He 
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Asn 


Val 










100 










105 










110 








ttc 
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acc 


gat 


cca 


teg 


gaa 


ato 

y \-y 


cac 


att 


cac 


att 


cgc 


ega 


ata 


cca 


384 


Phe 


Gly 


Thr 


Asp 


Pro 


Ser 


Glu 


Val 


His 


He 


His 


He 


Arg 


Arg 


He 


Pro 
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120 










125 










att 


tct 


aaa 

y ay 


att 


cct 


caa 


tea 


gaa 


gac 
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atg 


aeg 
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taa 
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tat 


432 


lie 


Ser 


Glu 


lie 
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Gin 


Ser 


Glu 
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Thr 


Gin 
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480 
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T VT 
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Ser 


Val 


Val 


Ser 
20 


Leu 
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Ser 
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25 
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Leu 


Leu 


Asn 


Val 
30 


Leu 


Ser 
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Leu 


Ser 
35 


Phe 
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Thr 


Tyr 
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40 


Asp 


Val 


Asn 


Val 


He 
45 


Asp 


Tyr 


Tyr 


Asp 


Ser 


Glu 


Ala 


Lys 


Arg 


Asp 


Thr 


Gly Asn Ala 


He 


Gly 


Arg 


Glu 


Lys 
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Glu 


Leu 


Val 


Asn 


Val 


Leu 


Gin 


Pro 


Arg 


Thr 


Arg 


Gly 


Phe 


65 










70 










75 










80 


Val 


Thr 


Cys 


Leu 


Ser 
85 


Gin 


Ser 


Arg 


Cys 


Ser 
90 


Leu 


Asp 


Ala 


Val 


Tyr 
95 


Asp 



6 



Leu Thr lie 

Phe Gly Thr 

115 

lie Ser Glu 
130 

Asp Leu Phe 
145 

Gly Ser Phe 

Glu Gly Val 

Phe Trp Cys 
195 

Ala Ser Leu 

210 
Pro Val Tyr 
225 



Gly Tyr 
100 

Asp Pro 

lie Pro 

Tyr Gin 

Pro Asp 
165 
Cys Asn 

180 

Leu Phe 
Leu Leu 
Ser Ser 



Lys Lys 

Ser Glu 

Gin Ser 
135 
Lys Asp 
150 

Ser Gly 
Val Ala 

His Ser 

Ala Phe 
: 215 
Leu Arg 
230 



Arg Cys 
105 
Val His 

120 

Glu Asp 
Gin Met 
He Glu 

Leu His 

185 
Val Trp 
200 

Ser Thr 
Thr Lys 



Pro Leu Phe 

He His He 

Gly Met Thr 
140 

Leu Ala Ser 
155 

Glu Ser Pro 
170 

Val Val Leu 

Leu Lys Leu 

Tyr Phe Asp 
220 

Arg Lys He 
235 



He Asn 
110 
Arg Arg 
125 

Gin Trp 

Phe Ser 

Leu Asn 

Ser Gly 
190 
Tyr Val 
205 

Trp Arg 
Val 



Asn Val 
He Pro 

Leu Tyr 

Lys Thr 
160 
He Val 
175 

Trp Val 
Ala Phe 
Pro Lys 
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<223> LPAAT 
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accaggtcga 
accggactga 
acaatatgtg 
taatcgcctt 
gataccctca 
tgttgcagct 
agccaactgt 
tggctaaagc 
tggctgcaga 



gatgcccatt 
tgctcagtct 
gtcccacgtc 
caaaacagga 
caagtatgtc 
gatgactcag 
gagagagatg 
gttaggcatc 
aaagctcaaa 



attggactgt 
aggcaccatg 
atgttgttcc 
gcattttcgc 
aacccctctt 
ttcatcaacc 
aaataccctc 
gtgtgcacag 
cagcctt 



ttttgcaagc 
cggctggcaa 
cggagggcac 
ctggtctccc 
ggtgtgacca 
acatggaggt 
atgaattcgc 
aacacagctt 



tttgcaaata 
cgttcggcga 
taccaccaat 
tgtgcagcca 
aggaggtccg 
tgaatatttg 
aagtagagtt 
tctggatatt 



atacccgtgg 
agggctgtgg 
ggcagagcaa 
atggttatta 
ttggtcgttg 
ccggtcatga 
cgcagcgaga 
aagctagcgc 



60 
120 
180 
240 
300 
360 
420 
480 
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<212> DNA 

<213> Physcomitrella patens 



<220> 

<221> CDS 

<222> (1)..(1566) 

<223> LPAAT 



<400> 
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atg gag age aca gca gat gtc gga atg tec gac gac gat cct ate ett 48 
Met Glu Ser Thr Ala Asp Val Gly Met Ser Asp Asp Asp Pro lie Leu 
15 10 15 

ete aae ggg ete gaa aeg eea eta etg get gaa ttt eet ett gge gaa 96 
Leu Asn Gly Leu Glu Thr Pro Leu Leu Ala Glu Phe Pro Leu Gly Glu 

20 25 30 

egg cct aca ata ggg ccg gag gca cca gta aat ccc ttc cat gaa ccc 14 4 

Arg Pro Thr lie Gly Pro Glu Ala Pro Val Asn Pro Phe His Glu Pro 

35 40 45 

gat ggt ggt tgg aag ace aae aae gag tgg aat tac ttt eaa atg atg 192 
Asp Gly Gly Trp Lys Thr Asn Asn Glu Trp Asn Tyr Phe Gin Met Met 

50 55 60 

aaa tee att ttg etg att eea ett ett ete gtt egt eta gtg age atg 240 
Lys Ser lie Leu Leu lie Pro Leu Leu Leu Val Arg Leu Val Ser Met 
65 70 75 80 

ata aca ate gta gea ttt gga tat gtg tgg ate agg att tgt etg ate 288 
He Thr He Val Ala Phe Gly Tyr Val Trp He Arg He Cys Leu He 

85 90 95 

gge gtc aca gat ccc ttg ttt aag cct ttc aat ccg tgt ega egg ttc 336 
Gly Val Thr Asp Pro Leu Phe Lys Pro Phe Asn Pro Cys Arg Arg Phe 

100 105 110 

atg etg tgg gge ata egg tta gta gca aga gca gtg atg ttt ace atg 384 
Met Leu Trp Gly He Arg Leu Val Ala Arg Ala Val Met Phe Thr Met 

115 120 125 

ggt tat tac tac att ccc ate aag gga aaa eeg get eae ega tea gag 432 
Gly Tyr Tyr Tyr He Pro He Lys Gly Lys Pro Ala His Arg Ser Glu 

130 135 140 

gcg ccc att att gtg tee aat eae att gga ttt etg gat ccc ate ttt 480 
Ala Pro He He Val Ser Asn His He Gly Phe Leu Asp Pro He Phe 
145 150 155 160 

gtg ttc tat egg eae ttg eeg gee ate gtc tea gee aag gag aae gtc 528 
Val Phe Tyr Arg His Leu Pro Ala He Val Ser Ala Lys Glu Asn Val 

165 170 175 

gag atg ccc att att gga etg ttt ttg caa get ttg caa ata ata ccc 576 
Glu Met Pro He He Gly Leu Phe Leu Gin Ala Leu Gin lie He Pro 

180 185 190 

gtg gac egg act gat get eag tet agg eae eae gcg get gge aae gtt 624 
Val Asp Arg Thr Asp Ala Gin Ser Arg His His Ala Ala Gly Asn Val 

195 200 205 

egg ega agg get gtg gac aat atg tgg tec eae gtc atg ttg ttc eeg 672 
Arg Arg Arg Ala Val Asp Asn Met Trp Ser His Val Met Leu Phe Pro 

210 215 220 

eag gge act acc ace aat gge aga gea ata ate gee ttc aaa aca gga 720 
Gin Gly Thr Thr Thr Asn Gly Arg Ala He He Ala Phe Lys Thr Gly 
225 230 235 240 

gca ttt teg cct ggt etc cct gtg eag cca atg gtt att aga tac cct 7 68 

Ala Phe Ser Pro Gly Leu Pro Val Gin Pro Met Val He Arg Tyr Pro 

245 250 255 

cac aag tat gtc aae ccc tet tgg tgt gac caa gga ggt ccg ttg gtc 816 
His Lys Tyr Val Asn Pro Ser Trp Cys Asp Gin Gly Gly Pro Leu Val 

260 265 270 

gtt gtg ttg eag etg atg act eag ttc ate aae eae atg gag gtt gaa 864 
Val Val Leu Gin Leu Met Thr Gin Phe He Asn His Met Glu Val Glu 

275 280 285 

tat ttg eeg gtc atg aag cca act gtg aga gag atg aaa tac eet cat 912 
Tyr Leu Pro Val Met Lys Pro Thr Val Arg Glu Met Lys Tyr Pro His 

290 295 300 
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gaa ttc gca agt aga gtt 
Glu Phe Ala Ser Arg Val 
305 310 
gtg tgc aca gaa cac age 
Val Cys Thr Glu His Ser 
325 

gaa aag etc aaa eag cct 
Glu Lys Leu Lys Gin Pro 
340 

atg gag aag tta ttt egg 
Met Glu Lys Leu Phe Arg 

355 

gaa aag ttc age gee atg 
Glu Lys Phe Ser Ala Met 
370 

gag gag tta tgt acg gca 
Glu Glu Leu Cys Thr Ala 
385 390 
eag gtg ttc aac ctt ttc 
Gin Val Phe Asn Leu Phe 
405 

cga gag ttt ttg gca ggg 
Arg Glu Phe Leu Ala Gly 
420 

tea agt aca atg gag get 
Ser Ser Thr Met Glu Ala 
435 

gge act ett tct cgt gat 
Gly Thr Leu Ser Arg Asp 
450 

eca gag etc cct cca ata 
Pro Glu Leu Pro Pro lie 
465 470 
ata aat cat gat gag aaa 
lie Asn His Asp Glu Lys 
485 

eag cga aac cca gag tat 
Gin Arg Asn Pro Glu Tyr 
500 

ctg ctg aag cca cec aca 
Leu Leu Lys Pro Pro Thr 
515 
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cgc age gag atg get aaa 
Arg Ser Glu Met Ala Lys 
315 

ttt ctg gat att aag eta 
Phe Leu Asp lie Lys Leu 
330 

tea ggt egg teg ttg gtt 
Ser Gly Arg Ser Leu Val 

345 

ctg gat ttt cct acg gcg 
Leu Asp Phe Pro Thr Ala 

360 

gac cgc aca cac agt gge 
Asp Arg Thr His Ser Gly 
375 380 
ctg gat ctt cca cgc tea 
Leu Asp Leu Pro Arg Ser 
395 

gat aag gat ggg cat gga 
Asp Lys Asp Gly His Gly 
410 

etc gee ttt gtg tec age 
Leu Ala Phe Val Ser Ser 
425 

gca ttt aaa gca tgt gat 
Ala Phe Lys Ala Cys Asp 
440 

gaa gtg gag agg agt ttg 
Glu Val Glu Arg Ser Leu 
455 460 
acg gtg ttc aag ctt ttt 
Thr Val Phe Lys Leu Phe 
475 

ate age tgg gag gag ttc 
lie Ser Trp Glu Glu Phe 
490 

ctg gee ate att ata tat 
Leu Ala lie lie lie Tyr 
505 

teg act age tga 
Ser Thr Ser 
520 



gcg tta gge ate 960 
Ala Leu Gly lie 
320 

gcg ctg get gca 1008 
Ala Leu Ala Ala 
335 

gag ttt get cgc 1056 
Glu Phe Ala Arg 
350 

aag gaa tac ttg 1104 
Lys Glu Tyr Leu 

365 

ttt gtt aca ttt 1152 
Phe Val Thr Phe 

cca att act aag 1200 
Pro lie Thr Lys 
400 

age ata aac ttt 1248 
Ser lie Asn Phe 
415 

cac aca tea ttt 1296 
His Thr Ser Phe 
430 

gtg aat gge gat 1344 

Val Asn Gly Asp 

445 

ctt gat ate ttt 1392 
Leu Asp lie Phe 

gac acg tta gat 1440 
Asp Thr Leu Asp 
480 

agt age ttt ctg 1488 
Ser Ser Phe Leu 
495 

gcg cac cct act 1536 
Ala His Pro Thr 
510 

1566 



<211> 521 



<212> PRT 



<213> Physcomitrella patens 
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Met 


Glu 


Ser Thr Ala 


Asp 


Val 


Gly 


Met 


Ser 


Asp 


Asp 


Asp 


Pro 


He 


Leu 


1 




5 










10 










15 




Leu 


Asn 


Gly Leu Glu 


Thr 


Pro 


Leu 


Leu 


Ala 


Glu 


Phe 


Pro 


Leu 


Gly 


Glu 






20 








25 










30 






Arg 


Pro 


Thr He Gly 


Pro 


Glu 


Ala 


Pro 


Val 


Asn 


Pro 


Phe 


His 


Glu 


Pro 



9 

















4 0 










4 5 








Asp 


Gly 


Gly 


TrD 

X X|^ 


Lys 


Thr 


Asn 


Asn 


Glu 


Tro 

X X 


Asn 


Tyr 


Phe 


Gin 


Met 


Met 




50 










55 










60 










Lys 


Ser 


lie 


Leu 


Leu 


lie 


Pro 


Leu 


Leu 


Leu 


Val 


Arg 


Leu 


Val 


Ser 


Met 


\j -J 










70 










75 










80 


lie 


Thr 


lie 


Val 


Ala 


Phe 


Gly 


Tvr 


Val 


Tro 


He Arg 


He 


Cys 


Leu 


He 










85 










90 










95 




\j J- y 


vox 


Thr 

1 I IX 




Pto 


Leu 


Phe 


j—i y o 


Pto 


Phe 


Asn 


Pro 


Cys 


Arg 


Arg 


Phe 








100 










105 










110 






Mp1- 


XjC Ll 


lip 




T 1 p 

X X c 


Arg 


XiC? Ll 


Val 


Ala 


Arg 


Ala 


Val 


Met 


Phe 


Thr 


Met 






X X <J 










120 










125 








y 


i yx 


1 yx 


Tvr* 
1 yx 


lie 


Pro 


lie 


Lys 


Gly 


Lys 


Pro 


Ala 


His 


Arg 


Ser 


Glu 




X J w 










135 










140 










Ala 


Pro 


lie 


lie 


Val 


Ser 


Asn 


His 


He 


Gly 


Phe 


Leu 


Asp 


Pro 


He 


Phe 


1 4 S 

± rt <J 










1 50 

X ^ \J 










155 










160 


Val 


Phe 


I yx 


Arg 


His 


Leu 


Pro 


Ala 


lie 


Val 


Ser 


Ala 


Lys 


Glu 


Asn 


Val 










1 fiR 

X vj J 










170 

X / VJ 










17 5 

X / J 




ox u 


Meat* 


Pto 

IT X *J 


Tip 

X X c 


T 1 P 

X X c 


f^l V 

ox 


Leu 


Phe 


Leu 


Gin 


Ala 


Leu 


Gin 


He 


He 


Pro 








1 ftO 
X o u 










1 fi5 

X O .J 










190 

X ^ w 






V d J. 


7\ oT-> 


rix y 


1 ilX 




Al 
rix d 


ni n 


^PT 
O X 


A TO 


nx o 


His 


Ala 


Al 

r\jL d 


ni M 

o X y 


Asn 


Val 






1 Q5 
X _/ J 










900 










90 5 








Airg 


Arcf 


Arg 


Ala 


Val 


Asp 


Asn 


Met 


Tro 


Ser 


His 


Val 


Met 


Leu 


Phe 


Pro 




^ X L/ 










91 5 










220 












\j JLy 


Th y 

X I IX 


Th T 

1 HI. 


1 1 1 X 


A c n 


(^1 \/ 
ox y 


A TO 

rAX y 


Al ^ 


He 


He 


Ala 


Php 


xjy o 


Thr 


r;i V 

ox y 


^ u 










9^0 
Z, J u 










235 










94 0 

Z. 'i U 


Al p 
raJ. d 


IT lie 


oc^x 


Pm 
IT X ^ 


ni V 

vjx y 


T .o n 
Xj C Li 


Pto 


1 

V dx 


OXil 


Pro 


Met 


Val 


Hp 

X X c 


Ato 
r^x y 


Tvr 
iyx 


Pro 

t X w 










94 5 










250 










955 

Z. ^ 




11 -L o 


X) _y o 


lyx 


Val 


A 


t X w 


O ~ X 


Tto 
1 xp 


Cys 


Asp Gin Gly 


Gly 


Pro 


Leu 


Val 








9 fiO 










965 

Z. v> •J 










970 






V dJ- 


V dX 


Xic u 


ftri n 
oxil 


XiC LI 




Th T 

J. 1 IX 


r;i n 

OXil 


Php 
rri ic 


He 


Asn 


His 


MpI- 

1 iC L. 


f;i n 

ox LI 


1 

V d X 


ni n 

ox LI 






9 7 5 










9fi 0 
z. o u 










9R 5 








i yi 


Leu. 


Pro 


Val 


Met 


Lys 


Pro 


Thr 


Val 


Arg 


Glu 


Met 


Lys 


Tvr 
X y X 


Pro 


His 




9 QO 










9 Q 5 










300 














Al ;^ 
r^x d 


O C X 




V;^ 1 
V dx 


A TO 
rtx y 


Spt 


O X LI 


Met 


Ala 


Lys 


Ala 


Leu 


Gly 


He 


O U J 










'^1 0 

J X L/ 










315 










'^90 


V ax 


^ y o 


Thr 


Glu 


His 


Ser 


Phe 


Leu 


Asp 


He 


Lys 


Leu 


Ala 


Leu 


Ala 


Ala 










^9 5 

~j -.J 










330 










335 




\jx l* 


T.\7Q 

Xj_y o 


T .on 

XiC U. 


Xi_y o 


C\ n 

OXii 


irx LJ 


QPT 
OCX 




Ato 


Ser 


Leu 


Val 


ox LI 


Phe 


Ala 


Arg 








'^4 0 
J 1 u 










"^4 5 

T 










350 








O X LX 


xj Y o 


Xj c u. 


Phe 


Arg 


Leu 


Asp 


Phe 


Pro 


Thr 


Ala 


Lys 


Glu 


Tvr 
X y I. 


Leu 






^5 5 
-J J J 










J VJ U 


















V3X \X 






C Q -p 

OCX 


Al 
r^xd 


Mpt- 


A <?o 


Arg 


Thr 


His 


Ser 


Gly 


Phe 


Val 


Thr 


Phe 














T7 5 










380 










Glu 


Glu 


Leu 


Cys 


Thr 


Ala 


Leu 


Asp 


Leu 


Pro 


Arg 


Ser 


Pro 


He 


Thr 


Lys 


^ O .J 




















395 










4 00 


Gin 


Val 


Phe 


Asn 


Leu 


Phe 


Asp 


Lys 


Asp 


Gly His 


Gly 


Ser 


He 


Asn 


Phe 










405 










410 










415 




Arg 


Glu 


Phe 


Leu 


Ala 


Gly 


Leu 


Ala 


Phe 


Val 


Ser 


Ser 


His 


Thr 


Ser 


Phe 








420 










425 










430 






Ser 


Ser 


Thr 


Met 


Glu 


Ala 


Ala 


Phe 


Lys 


Ala 


Cys 


Asp 


Val 


Asn 


Gly 


Asp 






4 5 

ri O O 










4 4 0 










44 5 

*i *i o 








Gly 


Thr 


Leu 


Ser 


Arg 


Asp 


Glu 


Val 


Glu Arg 


Ser 


Leu 


Leu 


Asp 


He 


Phe 




450 










455 










4 60 










Pro 


Glu 


Leu 


Pro 


Pro 


He 


Thr 


Val 


Phe 


Lys 


Leu 


Phe 


Asp 


Thr 


Leu 


Asp 


465 










470 










475 










480 


lie 


Asn 


His 


Asp 


Glu 


Lys 


He 


Ser 


Trp 


Glu 


Glu 


Phe 


Ser 


Ser 


Phe 


Leu 










485 










490 










495 





10 



Gin Arg Asn Pro Glu Tyr Leu Ala lie lie lie Tyr Ala His Pro Thr 

500 505 510 

Leu Leu Lys Pro Pro Thr Ser Thr Ser 
515 520 

<210> 9 
<211> 2217 
<212> DNA 

<213> Physcomitrella patens 

<220> 

<221> CDS 

<222> (281) . . (1837) 

<223> LPAAT2 

<400> 9 



ggcgcgccag 
tctgaatcat 
tcttggttac 

acgagtggct 
aagaatcgac 



aggacgagac 
caactaaggg 
ccaagactga 

ccgagttttt 
actgcacagg 



aaggggggcg 
tatggttata 
atgaagatgg 

tggtactcag 
cctcaccatc 



ctgtggactt 
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gaagtggaac 
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atg ttc aca gaa 
Met Phe Thr Glu 
10 

cat get gag ttt 
His Ala Glu Phe 

25 

cea gtg aac ccc 
Pro Val Asn Pro 

ate aag ace att 
He Lys Thr He 
60 

atg ttc get etc 
Met Phe Ala Leu 
75 

ate gga tgc aaa 
He Gly Cys Lys 
90 

etc ttg ttg gta 
Leu Leu Leu Val 

105 

atg ggg tat tac 
Met Gly Tyr Tyr 

gtg gcg ccc att 
Val Ala Pro He 
140 

ttt gtg ttc tat 
Phe Val Phe Tyr 
155 

gtg gag atg ccc 



gac 
Asp 

cct 
Pro 

ttc 

Phe 

45 

ctg 

Leu 



act age act 
Thr Ser Thr 
15 

ctt gga gag 
Leu Gly Glu 
30 

cae gag tec 
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atg atg 
Met Met 

gac ccg 
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Ser Val 
110 
tat ate 
Tyr He 
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He Val 
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<400> 18 

atg aac cct ate tac aag ggt ctg cga gcc att gtc tgg gcc ttt tac 48 
Met Asn Pro lie Tyr Lys Gly Leu Arg Ala lie Val Trp Ala Phe Tyr 
15 10 15 

ttc aac ctg gga gcg teg ctt ata teg ate aeg eag gtg ctg teg ctg 96 
Phe Asn Leu Gly Ala Ser Leu lie Ser lie Thr Gin Val Leu Ser Leu 

20 25 30 

cct ctg geg ttg att get eea ggg gtc tac eag tgg cac ate age aaa 144 
Pro Leu Ala Leu lie Ala Pro Gly Val Tyr Gin Trp His lie Ser Lys 

35 40 45 

aca eag ggt eac ttt gga get ttc ctg etc egg atg aac eag etc ttt 192 
Thr Gin Gly His Phe Gly Ala Phe Leu Leu Arg Met Asn Gin Leu Phe 

50 55 60 

gcg ccg tea gat att gtc ttg aca ggg gac gag agt gtc agg gga ate 240 
Ala Pro Ser Asp lie Val Leu Thr Gly Asp Glu Ser Val Arg Gly lie 
65 70 75 80 

gtc aag gtc tac aaa gga egg aac ctg aag gag gee ggt gag eca gge 288 
Val Lys Val Tyr Lys Gly Arg Asn Leu Lys Glu Ala Gly Glu Pro Gly 

85 90 95 

age ggt eag gga gag gac att ctt ctg gat atg ecc gag agg atg gtt 336 
Ser Gly Gin Gly Glu Asp lie Leu Leu Asp Met Pro Glu Arg Met Val 

100 105 110 

ttc att gcg aac cac eag ate tac tet gac tgg atg tac etc tgg tgc 384 
Phe lie Ala Asn His Gin lie Tyr Ser Asp Trp Met Tyr Leu Trp Cys 

115 120 125 

ttc tec tat ttt gca gag agg cac agg gca ctg aag att att ctt egg 432 
Phe Ser Tyr Phe Ala Glu Arg His Arg Ala Leu Lys lie lie Leu Arg 

130 135 140 

gge gac ctg acc tgg ate cct gtc ttt ggc tgg ggt atg egg ttc ttt 480 
Gly Asp Leu Thr Trp lie Pro Val Phe Gly Trp Gly Met Arg Phe Phe 
145 150 155 160 

gac ttt ate ttt ttg aaa cgt aat gac tgg gca cac gat cgc cgt gcc 528 
Asp Phe lie Phe Leu Lys Arg Asn Asp Trp Ala His Asp Arg Arg Ala 

165 170 175 

att gag gaa aac ttg gga cgt gtc aag gaa aag gat ecc etc tgg etc 576 
lie Glu Glu Asn Leu Gly Arg Val Lys Glu Lys Asp Pro Leu Trp Leu 

180 185 190 

gtg gtc ttc ecc gag gga aca gtc gtc tec aag gaa aeg cgt etc cga 624 
Val Val Phe Pro Glu Gly Thr Val Val Ser Lys Glu Thr Arg Leu Arg 

195 200 205 

tec gtt gcc ttt tea aag aag get agt ctg teg gat cac cgc eat gtg 672 
Ser Val Ala Phe Ser Lys Lys Ala Ser Leu Ser Asp His Arg His Val 

210 215 220 

ctg ctt eea agg ace age ggt ctg ttt gtg tgc ate aac aag ttg cgt 720 
Leu Leu Pro Arg Thr Ser Gly Leu Phe Val Cys lie Asn Lys Leu Arg 
225 230 235 240 

gga tet gtc gac tac ttg tac gat gca acc gtt ggc tac teg aat gtc 768 
Gly Ser Val Asp Tyr Leu Tyr Asp Ala Thr Val Gly Tyr Ser Asn Val 

245 250 255 

gag tat gge gag att ccg eag gag ctt tac ccg tta cca gga ctg tat 816 
Glu Tyr Gly Glu lie Pro Gin Glu Leu Tyr Pro Leu Pro Gly Leu Tyr 

260 265 270 

ate aac aaa gca eag cee aag gag ate aac atg cac ctg cgt cga ttt 864 
lie Asn Lys Ala Gin Pro Lys Glu lie Asn Met His Leu Arg Arg Phe 

275 280 285 

gcg ate aag gat ate ecc aeg tea gaa ecc gaa ttt gtg gaa tgg gtc 912 
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He Gly Trp Ser Met Trp Phe Ser Glu Tyr Val Phe Leu Ser Arg Asp 

130 135 140 

tgg gcc aaa gat gag aag gtc ttg aag aat ggt tat tea agt ctt aag 722 
Trp Ala Lys Asp Glu Lys Val Leu Lys Asn Gly Tyr Ser Ser Leu Lys 

145 150 155 

ggc ttc ccc agg acc ttg tgg gtg get ctt ttt gtg gaa ggc act cga 770 
Gly Phe Pro Arg Thr Leu Trp Val Ala Leu Phe Val Glu Gly Thr Arg 
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ttt acg aag gcc aaa ctt gag get gcc caa aaa ttt gca gcg gat aca 818 
Phe Thr Lys Ala Lys Leu Glu Ala Ala Gin Lys Phe Ala Ala Asp Thr 

180 185 190 

ggg eta egt gtt cea agg eat gtg ctt gtt ect cgc aca aaa ggg ttc 866 
Gly Leu Arg Val Pro Arg His Val Leu Val Pro Arg Thr Lys Gly Phe 

195 200 205 

gtt teg get gtg gag aac ttg cgt gaa ttt gtt eeg gta gtt tat gac 914 
Val Ser Ala Val Glu Asn Leu Arg Glu Phe Val Pro Val Val Tyr Asp 

210 215 220 

atg ace gtt get ata tet aaa gag etg ccc aat eet aca atg ate egg 962 
Met Thr Val Ala He Ser Lys Glu Leu Pro Asn Pro Thr Met lie Arg 

225 230 235 

att ttc aga ggg eaa cea tet gtg gtt eat gtg eae gtg aga egg gtc 1010 
He Phe Arg Gly Gin Pro Ser Val Val His Val His Val Arg Arg Val 
240 245 250 255 

eet atg tet gat etg ect gag gga gee aac gcg att tet aaa tgg tgt 1058 
Pro Met Ser Asp Leu Pro Glu Gly Ala Asn Ala He Ser Lys Trp Cys 

260 265 270 

cac gat gee ttt cac ate aag gae gat egg etg gag eag eae gaa aaa 1106 
His Asp Ala Phe His He Lys Asp Asp Arg Leu Glu Gin His Glu Lys 
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gag aat acg ttt ggg gag gac ttg tat att eet att gaa egg cea ctt 1154 
Glu Asn Thr Phe Gly Glu Asp Leu Tyr He Pro He Glu Arg Pro Leu 
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aaa ect ctt att att gtg ate tee tgg gee ate act ttg etg get gca 1202 
Lys Pro Leu He He Val He Ser Trp Ala He Thr Leu Leu Ala Ala 
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gca tgg tgg ttt eta aga cga gtt tta tec act tgg aaa gga ate gee 1250 
Ala Trp Trp Phe Leu Arg Arg Val Leu Ser Thr Trp Lys Gly He Ala 
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tgg gtg gca gga gta etc gtg gtc gtc atg etg tgt gtc eag att tta 1298 
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accttgtagt ecacctaatt tcettgaaeg attaagagea geggeeatta gatgattega 1574 
tttggtttct tgatagtatc tggtaccttc ttetteaage attgtgtatt eegetteage 1634 
cattcctttt tttaagatgt attgcttctc gttcgagggt aggtcatttc tgatctaatt 1694 
ttgaaagcac taattc 1710 
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<213> Physcomitrella patens 
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<221> CDS 

<222> (58) . . (930) 

<223> Delta-6-elongase 
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<212> DNA 

<213> Phaeodactylum tricornutum 
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<221> CDS 

<222> (1) . . (1410) 

<223> Delta-5-desaturase 
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cgt 


CCC 


taa 


gtc 


aac 


aac 


atq 


eta 


qqc 


576 


Asn 


His 


Gly 


Ala 


Thr 


Ser 


Lys 


Arg 


Pro 


Tro 


Val 


Asn 


Asp 


Met 


Leu 


Glv 










180 










185 










190 








etc 


ggt 


aco 


gat 


ttt 


att 


aat 


aat 


tec 


aaa 


tqq 


etc 


tqq 


caq 


qaa 


caa 


624 


Leu Gly 


Ala 


Asp 


Phe 


lie 


Gly 


Gly 


Ser 


Lys 


TrD 


Leu 


TrD 


Gin 


Glu 


Gin 








195 










200 










205 










cac 


tgg 


acc 


cac 


cac 


get 


tac 


acc 


aat 


cac 


ace 


aaa 

y uy 


atq 


qat 


ccc 


qat 


672 


His 


Trp 


Thr 


His 


His 


Ala 


Tvr 


Thr 


Asn 


His 


Ala 


Glu 


Met 


Asp 


Pro 


Asp 






210 










215 










220 












age 


ttt 


aat 


gcc 


gaa 


cca 


atg 


etc 


eta 


ttc 


aac 


aac 


tat 


ccc 


ttq 


qat 


720 


Ser 


Phe 


Gly 


Ala 


Glu 


Pro 


Met 


Leu 


Leu 


Phe 


Asn 


Asp 


Tvr 


Pro 


Leu 


Asp 




225 










230 










235 










240 




cat 


ccc 


get 


cgt 


acc 


taa 


eta 


cat 


cgc 


ttt 


caa 


qca 


ttc 


ttt 


tac 


atg 


768 


His 


Pro 


Ala 


Arg 


Thr 


TrD 


Leu 


His 


Arg 


Phe 


Gin 


Ala 


Phe 


Phe 


Tvr 


Met 












245 










250 










255 






ccc 


gtc 


tta 

u uy 


get 


aaa 


tac 


taa 
^ ^ 


ttq 


tec 


get 


gtc 


ttc 


aat 


cca 


caa 


att 


816 


Pro 


Val 


Leu 


Ala 


Glv 


Tvr 


Tro 


Leu 


Ser 


Ala 


Val 


Phe 


Asn 


Pro 


Gin 


He 










260 










265 










270 








ctt 


gac 


etc 


cag 


caa 


cac 


aac 


qca 
^ ^ ^ 


ctt 


tec 


qtc 


qqt 


ate 


cqt 


etc 


qac 


864 


Leu 


Asp 


Leu 


Gin 


Gin 


Arg 


Gly 


Ala 


Leu 


Ser 


Val 


Gly 


He 


Arg 


Leu 


Asp 








275 










280 










285 










aac 


get 


ttc 


att 


cac 


tea 


caa 


cqc 


aag 


tat 


acq 


qtt 


ttc 


tqq 


cqq 


get 


912 


Asn 


Ala 


Phe 


lie 


His 


Ser 


Arg 


Arg 


Lys 


Tvr 


Ala 


Val 


Phe 


TrD 


Arg 


Ala 






290 










295 










300 












gtg 


tac 


att 


aca 


ata 


aac 


ata 


att 


act 


ccq 


ttt 


tac 


aca 


aac 


tec 


qqc 


960 


Val 


Tyr 


lie 


Ala 


Val 


Asn 


Val 


lie 


Ala 


Pro 


Phe 


Tvr 


Thr 


Asn 


Ser 


Glv 




305 










310 










315 










320 




etc 


gaa 


tag 


tec 


tgg 


cgt 


gtc 


ttt 


gga 


aac 


ate 


atg 


etc 


atg 


ggt 


gtg 


1008 


Leu 


Glu 


Trp 


Ser 


Trp 


Arq 


Val 


Phe 


Gly 


Asn 


He 


Met 


Leu 


Met 


Gly 


Val 












325 










330 










335 






gcg 


gaa 


teg 


etc 


gcg 


etg 


gcg 


gtc 


etg 


ttt 


teg 


ttg 


teg 


cac 


aat 


ttc 


1056 


Ala 


Glu 


Ser 


Leu 


Ala 


Leu 


Ala 


Val 


Leu 


Phe 


Ser 


Leu 


Ser 


His 


Asn 


Phe 










340 










345 










350 








gaa 


tec 


gcg 


gat 


cgc 


gat 


ceg 


acc 


gcc 


cca 


Gtg 


aaa 


aag 


acg 


gga 


gaa 


1104 


Glu 


Ser 


Ala 


Asp 


Arg 


Asp 


Pro 


Thr 


Ala 


Pro 


Leu 


Lys 


Lys 


Thr 


Gly 


Glu 








355 










360 










365 










cca 


gtc 


gac 


tgg 


ttc 


aag 


aca 


cag 


gtc 


gaa 


act 


tee 


tge 


act 


tac 


ggt 


1152 



59 



Pro 


Val 


Asp 


Tro 


Phe 


Lys 


Thr 


Gin 


Val 


Glu 


Thr 


Ser 


Cvs 


Thr 


Tvr 


Glv 






370 










375 










380 












QQ3. 

y y " 


ttc 


ctt 


tec 


aat 

1. 


tgc 


ttc 


aca 


gga 


ggt 


etc 


aac 


ttt 


caq 


gtt 
^ ^ 


qaa 


1200 


Glv 


Phe 


Leu 


Ser 


Glv 


Cys 


Phe 


Thr 


Gly Gly 


Leu 


Asn 


Phe 


Gin 


Val 


Glu 




385 










390 










395 










400 




cac 


cac 


tta 


ttc 


cca 


cgc 


at g 


age 


age 


get 


taa 


tat 


ccc 


tac 


att 


acc 


1248 


His 


His 


Leu 


Phe 


Pro 


Arg 


Met 


Ser 


Ser 


Ala 


Tro 


Tvr 


Pro 


Tvr 


lie 


Ala 












405 










410 










415 






ccc 


aag 


ate 


cac 


aaa 


att 


tac 


QCC 


aaa 


cac 


qqc 


qtc 


cac 


tac 


gcc 


tac 


1296 


Pro 


Lys 


Val 


Arg 


Glu 


lie 


Cys 


Ala 


Lys 


His 


Glv 


Val 


His 


Tvr 


Ala 


Tvr 










420 










425 










430 








tac 


ccg 


taq 


ate 


cac 


caa 


aac 


ttt 


etc 


tec 


acc 


gtc 


cgc 


tac 


atg 


cac 


1344 


Tvr 


Pro 


Trp 


lie 


His 


Gin 


Asn 


Phe 


Leu 


Ser 


Thr 


Val 


Arg 


Tyr 


Met 


His 








435 










440 










445 










gcg 


gcc 


ggg 


acc 


ggt 


gcc 


aac 


tgg 


cgc 


cag 


atg 


gcc 


aga 


gaa 


aat 


ccc 


1392 


Ala 


Ala 


Gly 


Thr 


Gly 


Ala 


Asn 


Trp 


Arg 


Gin 


Met 


Ala 


Arg 


Glu 


Asn 


Pro 






450 










455 










4 60 












ttg 


acc 


gga 


egg 


gcg 


taa 






















1410 


Leu 


Thr 


Gly 


Arg 


Ala 



























465 

<210> 51 

<211> 469 

<212> PRT 

<213> Phaeodactylum tricornutum 

<400> 51 



Met 


Ala 


Pro 


Asp 


Ala 


Asp 


Lys 


Leu 


Arg Gin Arg 


Gin 


Thr 


Thr 


Ala 


Val 


1 








5 








10 








15 




Ala 


Lys 


His 


Asn 
20 


Ala 


Ala 


Thr 


He 


Ser Thr Gin 
25 


Glu 


Arg 


Leu 
30 


Cys 


Ser 


Leu 


Ser 


Ser 
35 


Leu 


Lys 


Gly 


Glu 


Glu 
40 


Val Cys He 


Asp 


Gly 
45 


He 


He 


Tyr 


Asp 


Leu 
50 


Gin 


Ser 


Phe 


Asp 


His 
55 


Pro 


Gly Gly Glu 


Thr 
60 


He 


Lys 


Met 


Phe 


Gly 


Gly 


Asn 


Asp 


Val 


Thr 


Val 


Gin 


Tyr Lys Met 


He 


His 


Pro 


Tyr 


His 


65 










70 






75 










80 


Thr 


Glu 


Lys 


His 


Leu 
85 


Glu 


Lys 


Met 


Lys Arg Val 
90 


Gly 


Lys 


Val 


Thr 
95 


Asp 


Phe 


Val 


Cys 


Glu 
100 


Tyr 


Lys 


Phe 


Asp 


Thr Glu Phe 
105 


Glu 


Arg 


Glu 
110 


He 


Lys 


Arg 


Glu 


Val 
115 


Phe 


Lys 


He 


Val 


Arg 
120 


Arg Gly Lys 


Asp 


Phe 
125 


Gly 


Thr 


Leu 


Gly 


Trp 

130 


Phe 


Phe 


Arg 


Ala 


Phe 

135 


Cys 


Tyr He Ala 


He 

140 


Phe 


Phe 


Tyr 


Leu 


Gin 


Tyr 


His 


Trp 


Val 


Thr 


Thr 


Gly 


Thr Ser Trp 


Leu 


Leu 


Ala 


Val 


Ala 


145 










150 






155 










160 


Tyr 


Gly 


He 


Ser 


Gin 
165 


Ala 


Met 


He 


Gly Met Asn 
170 


Val 


Gin 


His 


Asp 
175 


Ala 


Asn 


His 


Gly 


Ala 
180 


Thr 


Ser 


Lys 


Arg 


Pro Trp Val 
185 


Asn 


Asp 


Met 
190 


Leu 


Gly 


Leu 


Gly 


Ala 

195 


Asp 


Phe 


He 


Gly 


Gly 

200 


Ser Lys Trp 


Leu 


Trp 
205 


Gin 


Glu 


Gin 


His 


Trp 
210 


Thr 


His 


His 


Ala 


Tyr 
215 


Thr 


Asn His Ala 


Glu 
220 


Met 


Asp 


Pro 


Asp 



60 



Ser Phe Gly Ala Glu Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Asp 



225 










230 










235 










240 


His 


Pro 


Ala 


Arg 


Thr 


Tro 


Leu 


His 


Ara 


Phe 


Gin 


Ala 


Phe 


Phe 


Tvr 

X y j- 


Met 










245 










250 










255 




Pro 


Val 


Leu 


Ala 


Gly 


Tvr 


Tro 


Leu 


Ser 


Ala 


Val 


Phe 


Asn 


Pro 


Gin 


He 








260 










265 










270 






Leu 


Asp 


Leu 


Gin 


Gin 


Arg 


Glv 


Ala 


Leu 


Ser 


Val 


Glv 


He 


Arg 


Leu 


Asp 






275 










280 










285 








Asn 


Ala 


Phe 


lie 


His 


Ser 


Arg 


Arg 


Lys 


Tvr 


Ala 


Val 


Phe 


Trn 


Arg 


Ala 




290 










295 










300 










Val 


Tvr 

X _y J. 


lie 


Ala 


Val 


Asn 


Val 


He 


Ala 


Pro 


Phe 


Tvr 

X y J. 


Thr 


Asn 


Ser 


Gly 


305 










310 










315 










320 


Lbu 


Glu 




Ser 


Tro 


Arg 


Val 


Phe 


Gly 


Asn 


He 


Met 


Leu 


Met 


Gly 


Val 










325 










330 










335 




Ala 


Glu 


Ser 


Leu 


Ala 


Leu 


Ala 


Val 


Leu 


Phe 


Ser 


Leu 


Ser 


His 


Asn 


Phe 








340 










345 










350 






Glu 


Ser 


Ala 


Asp 


Arg 


Asp 


Pro 


Thr 


Ala 


Pro 


Leu 


Lys 


Lys 


Thr 


Gly 


Glu 






355 










360 










365 








Pro 


Val 


Asp 


Trn 


Phe 


Lys 


Thr 


Gin 


Val 


Glu 


Thr 


Ser 


Cys 


Thr 


T VT 


Gly 




370 










375 










380 










Gly 


Phe 


Leu 


Ser 


Gly 


Cys 


Phe 


Thr 


Gly 


Gly 


Leu 


Asn 


Phe 


Gin 


Val 


Glu 


385 










390 










395 










400 


His 


His 


Leu 


Phe 


Pro 


Arg 


Met 


Ser 


Ser 


Ala 


1 




Pro 


i yi. 


Hp 

X J. V> 


Ala 










405 










410 










415 




Pro 


Lys 


Val 


Arg 


Glu 


He 


Cys 


Ala 


Lys 


His 


Gly 


Val 


His 


Tyr 


Ala 


Tyr 








420 










425 










430 






Tyr 


Pro 


Trp 


He 


His 


Gin 


Asn 


Phe 


Leu 


Ser 


Thr 


Val 


Arg 


Tyr 


Met 


His 






435 










440 










445 








Ala 


Ala 


Gly 


Thr 


Gly 


Ala 


Asn 


Trp 


Arg 


Gin 


Met 


Ala 


Arg 


Glu 


Asn 


Pro 




450 










455 










4 60 










Leu 


Thr 


Gly 


Arg 


Ala 

























465 



<210> 52 

<211> 3598 

<212> DNA 

<213> artificial sequence 

<220> 

<221> misc_feature 

<223> Sequence represents a plant promoter-terminator expression cassette 
in vector pUC19 



<400> 



52 



tcgcgcgttt 
cagcttgtct 
ttggcgggtg 
accatatgcg 
attcgccatt 
tacgccagct 
tttcccagtc 
gcaaatttac 
gtgtgttatg 
tttatgctaa 
tttttgtctt 



cggtgatgac 
gtaagcggat 
tcggggctgg 
gtgtgaaata 
caggctgcgc 
ggcgaaaggg 
acgacgttgt 
acattgccac 
tatttgattt 
cgtttgccaa 
ctaaatacat 



ggtgaaaacc 
gccgggagca 
cttaactatg 
ccgcacagat 
aactgttggg 
ggatgtgctg 
aaaacgacgg 
taaacgtcta 
gcgataaatt 
cacttagcaa 
atactaatca 



tctgacacat 
gacaagcccg 
cggcatcaga 
gcgtaaggag 
aagggcgatc 
caaggcgatt 
ccagtgaatt 
aacccttgta 
tttatatttg 
tttgcaagtt 
actggaaatg 



gcagctcccg 
tcagggcgcg 
gcagattgta 
aaaataccgc 
ggtgcgggcc 
aagttgggta 
cggcgcgccg 
atttgttttt 
gtactaaatt 
gattaattga 
taaatatttg 



gagacggtca 
tcagcgggtg 
ctgagagtgc 
atcaggcgcc 
tcttcgctat 
acgccagggt 
agctcctcga 
gttttactat 
tataacacct 
ttctaaatta 
ctaatatttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



61 

tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 

gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 

taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 

agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 

ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 

ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 

atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 

taatttcttc atagccagcc caccgcggtg ggcggccgcc tgcagtctag aaggcctcct 1140 

gctttaatga gatatgcgag acgcctatga tcgcatgata tttgctttca attctgttgt 1200 

gcacgttgta aaaaacctga gcatgtgtag ctcagatcct taccgccggt ttcggttcat 1260 

tctaatgaat atatcacccg ttactatcgt atttttatga ataatattct ccgttcaatt 1320 

tactgattgt ccgtcgacga attcgagctc ggcgcgccaa gcttggcgta atcatggtca 1380 

tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga 1440 

agcataaagt gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg 1500 

cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc 1560 

caacgcgcgg ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac 1620 

tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 1680 

cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 174 0 

aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct 1800 

gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa 1860 

agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 1920 

cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 1980 

cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 2040 

ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 2100 

gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 2160 

tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaagg 2220 

acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 2280 

tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag 2340 

attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac 2400 

gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 24 60 

ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 2520 

taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 2580 

ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag 2640 

ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca 2700 

gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact 2760 

ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca 2820 

gttaatagtt tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg 2880 

tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc 2940 

atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg 3000 

gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca 3060 

tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt 3120 

atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc 3180 

agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc 3240 

ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca 3300 

tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa 3360 

aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat 3420 

tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa 3480 

aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtctaagaa 3540 

accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtc 3598 

<210> 53 

<211> 3590 
<212> DNA 

<213> artificial sequence 



<220> 



62 

<221> misc_f eature 

<223> Sequence represents a plant promoter-terminator expression cassette 
in vector pUC19 

<400> 53 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cggcgcgccg agctcctcga 420 

gcaaatttac acattgccac taaacgtcta aacccttgta atttgttttt gttttactat 480 

gtgtgttatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 540 

tttatgctaa cgtttgccaa cacttagcaa tttgcaagtt gattaattga ttctaaatta 600 

tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 660 

tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 

gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 

taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 

agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 

ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 

ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 

atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 

taatttcttc atagccagcg gatccgatat cgggcccgct agcgttaacc ctgctttaat 1140 

gagatatgcg agacgcctat gatcgcatga tatttgcttt caattctgtt gtgcacgttg 1200 

taaaaaacct gagcatgtgt agctcagatc cttaccgccg gtttcggttc attctaatga 1260 

atatatcacc cgttactatc gtatttttat gaataatatt ctccgttcaa tttactgatt 1320 

gtccgtcgac gaattcgagc tcggcgcgcc aagcttggcg taatcatggt catagctgtt 1380 

tcctgtgtga aattgttatc cgctcacaat tccacacaac atacgagccg gaagcataaa 1440 

gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt tgcgctcact 1500 

gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc 1560 

ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg 1620 

ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc 1680 

cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag 1740 

gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca 1800 

tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca 1860 

ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg 1920 

atacctgtcc gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag 1980 

gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt 2040 

tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca 2100 

cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg 2160 

cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt 2220 

tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc 2280 

cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg 2340 

cagaaaaaaa ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg 2400 

gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta 2460 

gatcctttta aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg 2520 

gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg 2580 

ttcatccata gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc 2640 

atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc 2700 

agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc 2760 

ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag 2820 

tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat 2880 

ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg 2940 

caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt 3000 

gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag 3060 



63 



atgcttttct 
accgagttgc 
aaaagtgctc 
gttgagatcc 
tttcaccagc 
aagggcgaca 
ttatcagggt 
aataggggtt 
tatcatgaca 



gtgactggtg 
tcttgcccgg 
atcattggaa 
agttcgatgt 
gtttctgggt 
cggaaatgtt 
tattgtctca 
ccgcgcacat 
ttaacctata 



agtactcaac 
cgtcaatacg 
aacgttcttc 
aacccactcg 
gagcaaaaac 
gaatactcat 
tgagcggata 
ttccccgaaa 
aaaataggcg 



caagtcattc 
ggataatacc 
ggggcgaaaa 
tgcacccaac 
aggaaggcaa 
actcttcctt 
catatttgaa 
agtgccacct 
tatcacgagg 



tgagaatagt 
gcgccacata 
ctctcaagga 
tgatcttcag 
aatgccgcaa 
tttcaatatt 
tgtatttaga 
gacgtctaag 
ccctttcgtc 



gtatgcggcg 
gcagaacttt 
tcttaccgct 
catcttttac 
aaaagggaat 
attgaagcat 
aaaataaaca 
aaaccattat 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3590 



<210> 


54 


<211> 


3584 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


misc_f eature 


<223> 


Sequence represents 




in vector pUC19 


<400> 


54 



tcgcgcgttt 
cagcttgtct 
ttggcgggtg 
accatatgcg 
attcgccatt 
tacgccagct 
tttcccagtc 
gcaaatttac 
gtgtgttatg 
tttatgctaa 
tttttgtctt 
tactatagga 
gcaatgctgc 
taccacacaa 
agacaacaat 
ttaaaaatat 
ggaggatgca 
atataatgag 
taatttcttc 
gagatatgcg 
taaaaaacct 
atatatcacc 
gtccgtcgac 
gtgaaattgt 
agcctggggt 
tttccagtcg 
aggcggtttg 
cgttcggctg 
atcaggggat 
taaaaaggcc 
aaatcgacgc 
tccccctgga 
gtccgccttt 
cagttcggtg 



cggtgatgac 
gtaagcggat 
tcggggctgg 
gtgtgaaata 
caggctgcgc 
ggcgaaaggg 
acgacgttgt 
acattgccac 
tatttgattt 
cgtttgccaa 
ctaaatacat 
gaattaaagt 
atggatggca 
gatttgaggt 
gttaccacac 
tttggaaatg 
ataatgaaga 
gattttgcaa 
atagccagca 
agacgcctat 
gagcatgtgt 
cgttactatc 
gagctcggcg 
tatccgctca 
gcctaatgag 
ggaaacctgt 
cgtattgggc 
cggcgagcgg 
aacgcaggaa 
gcgttgctgg 
tcaagtcaga 
agctccctcg 
ctcccttcgg 
taggtcgttc 



ggtgaaaacc 
gccgggagca 
cttaactatg 
ccgcacagat 
aactgttggg 
ggatgtgctg 
aaaacgacgg 
taaacgtcta 
gcgataaatt 
cacttagcaa 
atactaatca 
gagtgaatat 
tatacaccaa 
gcatgaacgt 
acaagttttg 
atttgcatgg 
aaactacaaa 
tactttcatt 
gatctgccgg 
gatcgcatga 
agctcagatc 
gtatttttat 
cgccaagctt 
caattccaca 
tgagctaact 
cgtgccagct 
gctcttccgc 
tatcagctca 
agaacatgtg 
cgtttttcca 
ggtggcgaaa 
tgcgctctcc 
gaagcgtggc 
gctccaagct 



tctgacacat 
gacaagcccg 
cggcatcaga 
gcgtaaggag 
aagggcgatc 
caaggcgatt 
ccagtgaatt 
aacccttgta 
tttatatttg 
tttgcaagtt 
actggaaatg 
ggtaccacaa 
acattcaata 
cacgtggaca 
aggtgcatgc 
aagccatgtg 
tttacatgca 
catacacact 
catcgatccc 
tatttgcttt 
cttaccgccg 
gaataatatt 
ggcgtaatca 
caacatacga 
cacattaatt 
gcattaatga 
ttcctcgctc 
ctcaaaggcg 
agcaaaaggc 
taggctccgc 
cccgacagga 
tgttccgacc 
gctttctcat 
gggctgtgtg 



gcagctcccg 
tcagggcgcg 
gcagattgta 
aaaataccgc 
ggtgcgggcc 
aagttgggta 
cggcgcgccg 
atttgttttt 
gtactaaatt 
gattaattga 
taaatatttg 
ggtttggaga 
attcttgagg 
aaaggtttag 
atggatgccc 
taaaaccatg 
actagttatg 
cactaagttt 
gggccatggc 
caattctgtt 
gtttcggttc 
ctccgttcaa 
tggtcatagc 
gccggaagca 
gcgttgcgct 
atcggccaac 
actgactcgc 
gtaatacggt 
cagcaaaagg 
ccccctgacg 
ctataaagat 
ctgccgctta 
agctcacgct 
cacgaacccc 



gagacggtca 
tcagcgggtg 
ctgagagtgc 
atcaggcgcc 
tcttcgctat 
acgccagggt 
agctcctcga 
gttttactat 
tataacacct 
ttctaaatta 
ctaatatttc 
tttaattgtt 
ataataatgg 
taatttttca 
tgtggaaagt 
acatccactt 
catgtagtct 
tacacgatta 
ctgctttaat 
gtgcacgttg 
attctaatga 
tttactgatt 
tgtttcctgt 
taaagtgtaa 
cactgcccgc 
gcgcggggag 
tgcgctcggt 
tatccacaga 
ccaggaaccg 
agcatcacaa 
accaggcgtt 
ccggatacct 
gtaggtatct 
ccgttcagcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



64 



cgaccgctgc 
atcgccactg 
tacagagttc 
ctgcgctctg 
acaaaccacc 
aaaaggatct 
aaactcacgt 
tttaaattaa 
cagttaccaa 
catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 
gacattaacc 



gccttatccg 
gcagcagcca 
ttgaagtggt 
ctgaagccag 
gctggtagcg 
caagaagatc 
taagggattt 
aaatgaagtt 
tgcttaatca 
tgactccccg 
gcaatgatac 
gccggaaggg 
aattgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatgagcg 
acatttcccc 
tataaaaata 



gtaactatcg 
ctggtaacag 
ggcctaacta 
ttaccttcgg 
gtggtttttt 
ctttgatctt 
tggtcatgag 
ttaaatcaat 
gtgaggcacc 
tcgtgtagat 
cgcgagaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 
ggcgtatcac 



tcttgagtcc 
gattagcaga 
cggctacact 
aaaaagagtt 
tgtttgcaag 
ttctacgggg 
attatcaaaa 
ctaaagtata 
tatctcagcg 
aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctgacgtc 
gaggcccttt 



aacccggtaa 
gcgaggtatg 
agaaggacag 
ggtagctctt 
cagcagatta 
tctgacgctc 
aggatcttca 
tatgagtaaa 
atctgtctat 
cgggagggct 
gctccagatt 
gcaactttat 
tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
tagtgtatgc 
catagcagaa 
aggatcttac 
tcagcatctt 
gcaaaaaagg 
tattattgaa 
tagaaaaata 
taagaaacca 
cgtc 



gacacgactt 
taggcggtgc 
tatttggtat 
gatccggcaa 
cgcgcagaaa 
agtggaacga 
cctagatcct 
cttggtctga 
ttcgttcatc 
taccatctgg 
tatcagcaat 
ccgcctccat 
atagtttgcg 
gtatggcttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
ggcgaccgag 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 
aacaaatagg 
ttattatcat 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3584 



<210> 55 

<211> 4507 

<212> DNA 

<213> artificial sequence 



<220> 
<221> 
<223> 



misc__f eature 

Sequence represents a plant promoter-terminator expression cassette 
in vector pUC19 



<400> 55 



tcgcgcgttt 
cagcttgtct 
ttggcgggtg 
accatatgcg 
attcgccatt 
tacgccagct 
tttcccagtc 
gcaaatttac 
gtgtgttatg 
tttatgctaa 
tttttgtctt 
tactatagga 
gcaatgctgc 
taccacacaa 
agacaacaat 
ttaaaaatat 
ggaggatgca 



cggtgatgac 
gtaagcggat 
tcggggctgg 
gtgtgaaata 
caggctgcgc 
ggcgaaaggg 
acgacgttgt 
acattgccac 
tatttgattt 
cgtttgccaa 
ctaaatacat 
gaattaaagt 
atggatggca 
gatttgaggt 
gttaccacac 
tttggaaatg 
ataatgaaga 



ggtgaaaacc 
gccgggagca 
cttaactatg 
ccgcacagat 
aactgttggg 
ggatgtgctg 
aaaacgacgg 
taaacgtcta 
gcgataaatt 
cacttagcaa 
atactaatca 
gagtgaatat 
tatacaccaa 
gcatgaacgt 
acaagttttg 
atttgcatgg 
aaactacaaa 



tctgacacat 

gacaagcccg 
cggcatcaga 
gcgtaaggag 
aagggcgatc 
caaggcgatt 
ccagtgaatt 
aacccttgta 
tttatatttg 
tttgcaagtt 
actggaaatg 
ggtaccacaa 
acattcaata 
cacgtggaca 
aggtgcatgc 
aagccatgtg 
tttacatgca 



gcagctcccg 
tcagggcgcg 
gcagattgta 
aaaataccgc 
ggtgcgggcc 
aagttgggta 
cggcgcgccg 
atttgttttt 
gtactaaatt 
gattaattga 
taaatatttg 
ggtttggaga 
attcttgagg 
aaaggtttag 
atggatgccc 
taaaaccatg 
actagttatg 



gagacggtca 

tcagcgggtg 
ctgagagtgc 
atcaggcgcc 
tcttcgctat 
acgccagggt 
agctcctcga 
gttttactat 
tataacacct 
ttctaaatta 
ctaatatttc 
tttaattgtt 
ataataatgg 
taatttttca 
tgtggaaagt 
acatccactt 
catgtagtct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 

taatttcttc atagccagcc caccgcggtg ggcggccgcc tgcagtctag aaggcctcct 1140 

gctttaatga gatatgcgag acgcctatga tcgcatgata tttgctttca attctgttgt 1200 

gcacgttgta aaaaacctga gcatgtgtag ctcagatcct taccgccggt ttcggttcat 1260 

tctaatgaat atatcacccg ttactatcgt atttttatga ataatattct ccgttcaatt 1320 

tactgattgt ccgtcgagca aatttacaca ttgccactaa acgtctaaac ccttgtaatt 1380 

tgtttttgtt ttactatgtg tgttatgtat ttgatttgcg ataaattttt atatttggta 1440 

ctaaatttat aacacctttt atgctaacgt ttgccaacac ttagcaattt gcaagttgat 1500 

taattgattc taaattattt ttgtcttcta aatacatata ctaatcaact ggaaatgtaa 1560 

atatttgcta atatttctac tataggagaa ttaaagtgag tgaatatggt accacaaggt 1620 

ttggagattt aattgttgca atgctgcatg gatggcatat acaccaaaca ttcaataatt 1680 

cttgaggata ataatggtac cacacaagat ttgaggtgca tgaacgtcac gtggacaaaa 1740 

ggtttagtaa tttttcaaga caacaatgtt accacacaca agttttgagg tgcatgcatg 1800 

gatgccctgt ggaaagttta aaaatatttt ggaaatgatt tgcatggaag ccatgtgtaa 18 60 

aaccatgaca tccacttgga ggatgcaata atgaagaaaa ctacaaattt acatgcaact 1920 

agttatgcat gtagtctata taatgaggat tttgcaatac tttcattcat acacactcac 1980 

taagttttac acgattataa tttcttcata gccagcggat ccgatatcgg gcccgctagc 2040 

gttaaccctg ctttaatgag atatgcgaga cgcctatgat cgcatgatat ttgctttcaa 2100 

ttctgttgtg cacgttgtaa aaaacctgag catgtgtagc tcagatcctt accgccggtt 2160 

tcggttcatt ctaatgaata tatcacccgt tactatcgta tttttatgaa taatattctc 2220 

cgttcaattt actgattgtc cgtcgacgaa ttcgagctcg gcgcgccaag cttggcgtaa 2280 

tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata 234 0 

cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta actcacatta 2400 

attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 24 60 

tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg 2520 

ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 2580 

gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 2640 

ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 2700 

cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 2760 

ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 2820 

accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 2880 

catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 2940 

gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 3000 

tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 3060 

agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 3120 

actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 3180 

gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 3240 

aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 3300 

gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 3360 

aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 3420 

atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 3480 

gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg 3540 

atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga cccacgctca 3600 

ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 3660 

cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc tagagtaagt 3720 

agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca 3780 

cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca 3840 

tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga 3900 

agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa ttctcttact 3960 

gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga 4020 

gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga taataccgcg 4080 

ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg gcgaaaactc 4140 

tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc acccaactga 4200 

tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg aaggcaaaat 4260 

gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact cttccttttt 4320 

caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 4380 

atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctgac 4440 
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gtctaagaaa ccattattat catgacatta acctataaaa ataggcgtat cacgaggccc 4 500 
tttcgtc 4507 

<210> 56 

<211> 17752 

<212> DNA 

<213> Phaeodactylum tricornutum, Physcomitrella patens 
<220> 

<221> CDS 

<222> (11543) . . (12415) 

<223> Delta- 6-elongase 

<220> 

<221> CDS 

<222> (13313) . . (14890) 

<223> Delta- 6-ciesaturase 

<220> 

<221> CDS 

<222> (15791) (17200) 

<223> Delta-5-desaturase 

<400> 56 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 4 20 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 4 80 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 

ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 

ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 

ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 

tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 

ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 

gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 

ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 

gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 
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agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 204 0 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 24 60 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 264 0 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 

ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 

aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 294 0 

tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 

tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 

ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 

ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 

gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 

cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 

tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 

gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 

ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 

tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 

ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 

tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 

attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 

tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 

gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 

cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 

atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 

gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 

tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 

cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 

catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 

ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4 320 

acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 

agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 

tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4 620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 
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gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 54 60 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 

cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 

cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 

gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 

cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 68 4 0 

tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 

agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 

tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 

ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 

cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 

cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 

ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 

aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 

ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 

cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 

tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 

acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gctaggtagc 7560 

ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7620 

gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 

gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 

acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 

tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 

ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 

acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 

catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 

aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 

cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 

cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 

tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 

tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 

gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 

gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 84 60 

tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg- 8520 

gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 

tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 

gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 
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agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8760 

gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 

tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 

ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 

gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 

gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 

aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 

ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 

tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 

tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 

agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 

tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 

tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 954 0 

ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 

tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 

agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 

gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 

gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 

gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 

tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 

agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 

ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 

ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 

cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 

tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 

agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 

acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 

tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 

gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 

gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 

cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 

ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 

actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 

attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 

attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 

ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagccca ccgcggtgga aa atg gag gtc gtg gag aga ttc tac ggt gag 11572 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu 
15 10 

ttg gat ggg aag gtc teg cag ggc gtg aat gca ttg ctg ggt agt ttt 11620 
Leu Asp Gly Lys Val Ser Gin Gly Val Asn Ala Leu Leu Gly Ser Phe 

15 20 25 

ggg gtg gag ttg acg gat acg ccc act acc aaa ggc ttg ccc etc gtt 11668 
Gly Val Glu Leu Thr Asp Thr Pro Thr Thr Lys Gly Leu Pro Leu Val 

30 35 40 

gac agt ccc aca ccc ate gtc etc ggt gtt tet gta tac ttg act att 11716 
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Asp 


Ser 


Pro 
45 


Thr 


Pro 


He 


Val 


Leu 
50 


Glv 


Val 


Ser 


Val 


Tvr 
55 


Leu 


Thr 


He 




qtc 


att 


qoa 




ett 


ttq 


tqq 


ata 


aaq 


qee 


aqq 


qat 


etq 


aaa 


ecq 


eqe 


11764 


Val 


He 
60 


Gly 


Gly 


Leu 


Leu 


Trp 
65 


He 


Lys 


Ala 


Arg 


Asp 
70 


Leu 


Lys 


Pro 


Arg 




occ 


tCQ 


Qaq 


cea 


ttt 


ttq 


etc 


caa 


qct 


ttq 


qtq 


ctt 


qtq 


cae 


aac 


etq 


11812 


Ala 


Ser 


Glu 


Pro 


Phe 


Leu 


Leu 


Gin 


Ala 


Leu 


Val 


Leu 


Val 


His 


Asn 


Leu 




75 










80 










85 










90 




ttc 


tgt 


ttt 


QCQ 
^ ^ ^ 


etc 


aqt 


etq 


tat 


atg 


tqc 


qtq 


qqe 


ate 


qct 


tat 


caq 


11860 


Phe 


Cys 


Phe 


Ala 


Leu 
95 


Ser 


Leu 


Tyr 


Met 


Cys 
100 


Val 


Gly 


He 


Ala 


Tyr 
105 


Gin 




act 


att 


acc 


tqq 


cqq 


tac 


tet 


etc 


tqq 


qqe 


aat 


qca 


tac 


aat 


ect 


aaa 


11908 


Ala 


He 


Thr 


Tro 
110 


Arg 


Tvr 


Ser 


Leu 


Tro 
115 


Glv 


Asn 


Ala 


Tvr 


Asn 
120 


Pro 


Lys 




cat 


aaa 


aaa 


atg 


aca 


att 


ctg 


gta 


tac 


tta 

u uy 


ttc 


tac 


atg 


tet 


aag 


tae 


11956 


His 


Lys 


Glu 

125 


Met 


Ala 


He 


Leu 


Val 

130 


Tvr 


Leu 


Phe 


Tvr 


Met 

135 


Ser 


Lys 


Tyr 




QtQ 


aaa 


ttc 


atg 


aat 


ace 


qtt 


ate 


ata 


ata 


eta 


aag 


cae 


age 


ace 


agg 


12004 


Val 


Glu 


Phe 


Met 


Asp 


Thr 


Val 


He 


Met 


He 


Leu 


Lys 


Arg 


Ser 


Thr Arg 






140 










145 










150 












caa 


ata 


age 


ttc 


etc 


cae 


att 


tat 


eat 


eat 


tet 


tea 


att 


tec 


etc 


att 


12052 


Gin 


He 


Ser 


Phe 


Leu 


His 


Val 


Tyr 


His 


His 


Ser 


Ser 


He 


Ser 


Leu 


He 




155 










160 










165 










170 




taa 


taa 


act 


att 


act 


eat 


cae 


act 


ect 


aac 


aat 


gaa 


aca 


tat 


tgg 


tet 


12100 


Trp 


Trp 


Ala 


He 


Ala 
175 


His 


His 


Ala 


Pro 


Gly 
180 


Gly 


Glu 


Ala 


Tyr 


Trp 
185 


Ser 




Qca 


get 


ctg 


aac 


tea 


aaa 
y y " 


ata 
y ^y 


eat 


att 


etc 


atg 


tat 


oca 
y ^y 


tat 


tae 


ttc 


12148 


Ala 


Ala 


Leu 


Asn 
190 


Ser 


Gly 


Val 


His 


Val 
195 


Leu 


Met 


Tyr 


Ala 


Tyr 
200 


Tyr 


Phe 




tta 


get 


gcc 


tge 


ctt 


ega 


agt 


age 


eca 


aag 


tta 


aaa 


aat 


aag 


tae 


ctt 


12196 


Leu 


Ala 


Ala 
205 


Cys 


Leu 


Arg 


Ser 


Ser 
210 


Pro 


Lys 


Leu 


Lys 


Asn 
215 


Lys 


Tyr 


Leu 




ttt 


taa 
^ y y 


aac 
y y ^ 


aaa 


tac 


tta 


aca 


caa 


ttc 


caa 


atg 


ttc 


cag 


ttt 


atg 


ctg 


12244 


Phe 


Trp 
220 


Gly 


Arg 


Tyr 


Leu 


Thr 
225 


Gin 


Phe 


Gin 


Met 


Phe 
230 


Gin 


Phe 


Met 


Leu 




aac 


tta 


ata 


cag 


get 


tac 


tac 


gae 


atg 


aaa 


acg 


aat 


aca 


cea 


tat 


cea 


12292 


Asn 


Leu 


Val 


Gin 


Ala 


Tvr 


Tvr 


ASD 


Met 


Lvs 


Thr 


Asn 


Ala 


Pro 


Tyr 


Pro 




235 










240 










245 










250 




caa 


taq 


eta 


ate 


aag 


att 


ttq 
^ ^ ^ 


ttc 


tac 


tac 


atg 


ate 


teg 


ttg 


ctg 


ttt 


12340 


Gin 


Trp 


Leu 


He 


Lys 
? ^ S 


He 


Leu 


Phe 


Tyr 


Tyr 
260 


Met 


He 


Ser 


Leu 


Leu 
265 


Phe 




ctt 


ttc 


ggc 


aat 


ttt 


tac 


gta 


caa 


aaa 


tac 


ate 


aaa 


cec 


tet 


gae 


gga 


12388 


Leu 


Phe 


Gly 


Asn 


Phe 


Tyr 


Val 


Gin 


Lys 


Tyr 


He 


Lys 


Pro 


Ser 


Asp Gly 










270 










275 










280 








aag 


caa 


aag 


gga 


get 


aaa 


act 


gag 


tga 


tetagaagge eteetgettt 




12435 


Lys 


Gin 


Lys 

285 


Gly 


Ala 


Lys 


Thr 


Glu 

290 





















aatgagatat gegagaegee tatgategea tgatatttge ttteaattet gttgtgeaeg 12495 

ttgtaaaaaa ectgageatg tgtagetcag atecttaceg ccggtttegg ttcattctaa 12555 

tgaatatate aceegttaet atcgtatttt tatgaataat attcteegtt eaatttactg 12615 

attgteegte gageaaattt aeacattgee aetaaacgte taaaeeettg taatttgttt 12675 

ttgttttaet atgtgtgtta tgtatttgat ttgegataaa tttttatatt tggtaetaaa 12735 

tttataacae ettttatget aacgtttgee aacacttage aatttgeaag ttgattaatt 12795 

gattetaaat tatttttgte ttetaaatac atataetaat eaaetggaaa tgtaaatatt 12855 

tgctaatatt tetactatag gagaattaaa gtgagtgaat atggtaceac aaggtttgga 12915 

gatttaattg ttgeaatget gcatggatgg catatacace aaacatteaa taattcttga 12975 

ggataataat ggtaeeacae aagatttgag gtgeatgaae gtcaegtgga eaaaaggttt 13035 
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agtaattttt caagacaaca atgttaccac acacaagttt tgaggtgcat gcatggatgc 13095 

cctgtggaaa gtttaaaaat attttggaaa tgatttgcat ggaagccatg tgtaaaacca 13155 

tgacatccac ttggaggatg caataatgaa gaaaactaca aatttacatg caactagtta 13215 

tgcatgtagt ctatataatg aggattttgc aatactttca ttcatacaca ctcactaagt 13275 

tttacacgat tataatttct tcatagccag cggatcc atg gta ttc gcg ggc ggt 13330 

Met Val Phe Ala Gly Gly 
295 



gga 


ctt 


cag 


cag 


y y u 


tct 


etc 


gaa 


gaa 


aac 


ate 


gac 


gte 


aa a 

ydy 


cac 


att 


13378 


Gly Leu 


Gin 


Gin 


Gly 


Ser 


Leu 


Glu 


Glu 


Asn 


He 


Asp 


Val 


Glu 


His 


He 










300 










305 










310 








gcc 


agt 


atg 


tct 


etc 


ttc 


age 


gac 


ttc 


ttc 


agt 


tat 


ata 

y L.y 


tct 


tea 


act 


13426 


Ala 




Met 


Ser 


Leu 


Phe 


Ser 


Asp 


Phe 


Phe 


Ser 


Tvy 
■L yj- 


Val 


Ser 


Ser 


Thr 








315 










320 










325 










att 


y y L. 


L. ^y 




d y O 


y L. d 


c ^ c 
<jL <^ 


dy u 


at a 


e a a 


ee t 


ttg 


aag 


cgc 


ctg 


aeg 


13474 


Val 


Gly 




± xp 


C X 


Val 


His 


Ser 


He 


Gin 


Pro 


Leu 


Lys 


Arg 


Leu 


Thr 






^ ~j \j 










335 










340 












agt 


aag 


aag 


eg t 


y t. L 


teg 


gaa 


age 


get 


gee 


rrt rr 
y L-y 


eaa 


tat 

uy L. 


ata 


tea 


get 


13522 




j-i _y o 


T .\7Q 

j-iy o 


r^x y 


V d X 


C (3 "T 
OCX 


VJX Lt 


OCX 


Al ^ 


Al Pi 
rA.x d 


1 

V d X 


r;i n 

ox 11 


wy o 


Tip 

X X c 


Ser 


Ala 














350 










355 










360 




gaa 


at t 


r*^ rr 
o d y 


dy d 


dd L, 


L. y 


d y u 


d ^ 


v^d y 


rrrr;=» 


act 


rrr" rf 
y t^y 


rr;^ rr 
ydy 


gea 


etc 


gea 


13570 


VjX Li 


V d JL 




A "rrr 
r\l. y 


A c;n 


OCX 


OCX 


Thy 

X 1 IX 


V7X 1 1 


\D J. y 


Thy 
X 1 IX 


Al 


V3X LX 


Al 3 
r^x d 


T .^n 

XjC h 


Ala 












365 










370 










375 






y Cl a. 


tea 


g L c 


gtg 


aag 


e ee 


d L-y 


^ Pi 
dy d 


i^y d 


Pi rrrr 

agg 




t rt 


r* a rr 
^ dy 


t rrrr 


Pi PiCt 

d dy 


d d y 


X O U X o 


v3 -L Li 


Q Pi -K- 
OCX. 


V d X 


V dx 


Lys 


py r> 


Th y 

J. liX 


A y rr 

rA.X y 


A y rr 


A y rr 
r\l. y 


y 

OCX 


y 

OCX 


f;i n 


Tyo 


xjy o 


x_ty o 










380 










385 










390 








t eg 


aca 


Cl O 




rt ^ 


L.^d 


y d d 


y Ld 


y v^d 


y L. d 


^ c 


A a o 
dd^ 


Pi Pin 
dd y 


cea 


age 


gat 


13666 


Ser 


Thr 






Leu 


C Q -y- 

OCX 


wX LI 


V dx 


Al ^ 
raXd 


V d X 


rix o 


A «5n 


Xjy o 


Pyri 


OCX 


A <5y» 








395 










400 










4 05 










t gc 


uyy 


att 


y L L 


gta 


aaa 


aac 


aag 


y uy 


tat 


gat 


att 
y u u 


tee 


aat 


ttt 




13714 




i up 




V d X 


V dX 


Xiy o 


A 


Xiy o 


V dx 


Tvy 
i yr 


A <?yi 


1 

V d X 


C r* 

OCX 


Asn 


Phe 


Ala 






410 










415 










420 












gac 


era Q 

y ay 


cat 


e ee 


yy d 


yy d 


tea 


y L L 


att 


agt 


act 


tat 


ttt 


y y d 


ega 


gac 


13762 




r;i 11 


His 


£^ X V^' 


\3 A. y 


vj -L _y 


OCX 


Val 


lie 


Ser 


Thr 


T\/y 
X yx 


Phe 


Gly 


Arg 


Asp 




425 










430 










435 










440 






aca 


y d u 


y L L 






dy u 


U ^ I.. 


cat 


gea 


get 


tct 


aca 


uyy 


aaa 


att 


13810 


Gly 


Thr 


Asp 


Val 


Phe 


Ser 


Ser 


Phe 


His 


Ala 


Ala 


Ser 


Thr 


Tm 

X x^ 


Lys 


He 












445 










450 










455 






ctt 


caa 


gac 


ttt 


tac 


att 


yy L, 


gac 


rrt rr 
y L_y 


rr n rr 
ydy 


a rrrr 
dyy 


rrt a 
y uy 


rr 

ydy 


ccg 


act 


cea 


13858 


Leu 


Gin 


Asp 


Phe 


T vr 
J. y X 


lie 


Gly 


Asp 


Val 


Glu 


Arg 


Val 


Glu 


Pro 


Thr 


Pro 










4 60 










465 










470 








craa 


eta 


ctg 


aaa 


gat 


ttc 


ega 


gaa 


atg 


aga 


get 


ctt 


ttc 


ctg 


aaa 

ay y 


aaa 

y ay 


13906 


Glu 


Leu 


Leu 


Lys 


Asp 


Phe 


Arg 


Glu 


Met 


Arg 


Ala 


Leu 


Phe 


Leu 


Arg 


Glu 








475 










480 










485 










caa 


ctt 


ttc 


aaa 


aqt 


tea 


aaa 


ttg 


tac 


tat 


qtt 


ata 


aag 


ctg 


etc 


aeg 


13954 


Gin 


Leu 


Phe 


Lys 


Ser 


Ser 


Lys 


Leu 


Tyr 


Tyr 


Val 


Met 


Lys 


Leu 


Leu 


Thr 






490 










495 










500 












aat 


gtt 


get 


att 


ttt 


get 


gcg 


age 


att 


gea 


ata 


ata 


tgt 


tgg 


age 


aag 


14002 


Asn 


Val 


Ala 


He 


Phe 


Ala 


Ala 


Ser 


He 


Ala 


He 


He 


Cys 


Trp 


Ser 


Lys 




505 










510 










515 










520 




act 


att 


tea 


gcg 


gtt 


ttg 


get 


tea 


get 


tgt 


atg 


atg 


get 


ctg 


tgt 


ttc 


14050 


Thr 


He 


Ser 


Ala 


Val 


Leu 


Ala 


Ser 


Ala 


Cys 


Met 


Met 


Ala 


Leu 


Cys 


Phe 












525 










530 










535 






caa 


cag 


tgc 


gga 


tgg 


eta 


tec 


eat 


gat 


ttt 


etc 


cac 


aat 


cag 


gtg 


ttt 


14098 


Gin 


Gin 


Cys 


Gly 


Trp 


Leu 


Ser 


His 


Asp 


Phe 


Leu 


His 


Asn 


Gin 


Val 


Phe 










540 










545 










550 








gag 


aca 


cgc 


tgg 


ctt 


aat 


gaa 


gtt 


gte 


ggg 


tat 


gtg 


ate 


ggc 


aac 


gcc 


14146 


Glu 


Thr 


Arg 


Trp 


Leu 


Asn 


Glu 


Val 


Val 


Gly 


Tyr 


Val 


He 


Gly 


Asn 


Ala 
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555 560 565 



gtt 


ctg 




ttt 


aqt 


aca 


QQQ 

ZJ ZJ ZJ 


tqq 


tqq 


aag 


qaq 


aag 


eat 


aac 


ctt 


cat 


14194 


Val 


Leu 


Glv 


Phe 


Ser 


Thr 


Gly 


Trp 


Trp 


Lvs 


Glu 


Lvs 


His 


Asn 


Leu 


His 






570 










575 










580 












cat 


act 


act 


cca 


aat 


gaa 


tac 


qat 




act 


tac 


caa 


cca 


att 


qat 


qaa 


14242 


His 


Ala 


Ala 


Pro 


Asn 


Glu 


Cvs 


Asp 


Gin 


Thr 


Tyr 


Gin 


Pro 


He 


Asp 


Glu 




585 










590 










595 










600 




qat 


att 


aat 


act 


etc 


ccc 


etc 


att 


qee 


tqq 


age 


aag 


qac 


ata 


ctg 


gee 


14290 


Asp 


He 


Asp 


Thr 


Leu 


Pro 


Leu 


He 


Ala 


Trp 


Ser 


Lys 


Asp 


He 


Leu 


Ala 












605 










610 










615 






aca 


gtt 


aaa 


aat 


aaa 

**** 3 


aca 


ttc 


ttg 


eqa 


ate 


etc 


caa 


tac 


caq 


eat 


ctg 


14338 


Thr 


Val 


Glu 


Asn 


Lvs 


Thr 


Phe 


Leu 


Arq 


He 


Leu 


Gin 


Tyr 


Gin 


His 


Leu 










620 










625 










630 








ttc 


ttc 


atg 


aat 
y y 


ctg 


tta 


ttt 


ttc 


gee 


cgt 


ggt 


agt 


tqq 


etc 


ttt 


tqq 


14386 


Phe 


Phe 


Met 


Glv 


Leu 


Leu 


Phe 


Phe 


Ala 


Arg 


Gly Ser 


Trp 


Leu 


Phe 


Trp 








635 










640 










645 










age 


tea 


aga 


tat 


acc 


tct 


aca 


gca 


at a 


etc 


tea 


cet 


qt c 


qac 


aqq 


ttq 


14434 


Ser 


TrD 


Arg 


Tvr 


Thr 


Ser 


Thr 


Ala 


Val 


Leu 






Val 


Asp 


Arg 


Leu 






650 










655 










660 












tta 


y ay 


aag 




act 


att 

y u u 


ctg 


ttt 


cac 


tac 


ttt 


taa 

»-y y 


ttc 


gtc 


000 
y y y 


aca 


14482 


Leu 


Glu 


Lys 


Gly 


Thr 


Val 


Leu 


Phe 


His 


Tvr 


Phe 


TrD 


Phe 


Val 


Glv 


Thr 




665 










670 










675 










680 






t gc 


tat 


ctt 


etc 


cet 


aat 

y y u 


t aa 

L-y y 


aag 


cea 


tta 


gta 


taa 

L.y y 


atg 


aca 
y ^y 


ata 
y ^y 


14530 


Ala 


Cys 


Tvr 


Leu 


Leu 


Pro 


Gly 


TrD 


Lys 


Pro 


Leu 


Val 


TrD 


Met 


Ala 


Val 












685 










690 










695 






act 


oacr 
y 


etc 


atg 


tec 


aac 
yy ^ 


atg 


ctg 


ctg 


aac 
y *3 ^ 


ttt 


gta 


ttt 


qta 


ctt 


aac 


14578 


Thr 


Glu 


Leu 


Met 


Ser 


Glv 


Met 


Leu 


Leu 


Glv 


Phe 


Val 


Phe 


Val 


Leu 


Ser 










700 










705 










710 








cac 


aat 


aacr 
y y y 


atg 


aaa 

yay 


att 

y u u 


tat 


aat 


teg 


tct 


aaa 


gaa 


ttc 


ata 

y uy 


agt 


gca 


14626 


His 


Asn 


Gly 


Met 


Glu 


Val 


Tvr 


Asn 


Ser 


Ser 


Lys 


Glu 


Phe 


Val 


Ser 


Ala 








715 










720 










725 










cag 


ate 


gt a 


tec 


aca 


coo 

v^y y 


gat 


ate 


aaa 


aaa 

y y 0 


aac 


ata 


ttc 


aac 


gac 


taa 

uy y 


14674 


Gin 


He 


Val 


Ser 


Thr 


Arg 


Asp 


He 


Lys 


Glv 


Asn 


He 


Phe 


Asn 


Asp 


TrD 






730 










735 










740 












ttc 


act 


aat 


aac 
^y ^ 


ctt 


aac 


aaa 


caa 


ata 


aaa 


cat 


eat 


ctt 


ttc 


cca 


aca 


14722 


Phe 


Thr 


Glv 


Glv 


Leu 


Asn 


Arg 


Gin 


He 


Glu 


His 


His 


Leu 


Phe 


Pro 


Thr 




745 










750 










755 










760 




atg 


ccc 


aaa 
"y y 


cat 


aat 


tta 


aac 


aaa 


ata 


gca 


cet 


aga 


ata 
y *-y 


aaa 

y ay 


ata 
y "-y 


ttc 


14770 


Met 


Pro 


Arg 


His 


Asn 


Leu 


Asn 


Lys 


He 


Ala 


Pro 


Arg 


Val 


Glu 


Val 


Phe 












765 










770 










775 






tat 
^y »- 


aag 


aaa 


cac 


yy ^ 


ctg 


(Tt* CI 

y uy 


tac 


gaa 


gae 


gta 


tct 


att 


get 


ace 


ggc 


14818 


Cys 


Lys 


Lys 


His 


Gly 


Leu 


Val 


Tyr 


Glu 


Asp 


Val 


Ser 


He 


Ala 


Thr 


Gly 










780 










785 










790 








act 


tgc 


aag 


gtt 


ttg 


aaa 


gca 


ttg 


aag 


gaa 


gtc 


gcg 


gag 


get 


gcg 


gca 


14866 


Thr 


Cys 


Lys 


Val 


Leu 


Lys 


Ala 


Leu 


Lys 


Glu 


Val 


Ala 


Glu 


Ala 


Ala 


Ala 








795 










800 










805 










gag 


cag 


cat 


get 


acc 


acc 


agt 


taa 


getagegtta accctgettt aatgagatat 


14920 


Glu 


Gin 


His 


Ala 


Thr 


Thr 


Ser 























810 815 

gegagacgcc tatgatcgca tgatatttge tttcaattet gttgtgcacg ttgtaaaaaa 14980 

ectgagcatg tgtagctcag atecttaccg ecggtttegg tteattctaa tgaatatate 15040 

accegttact ategtatttt tatgaataat attctccgtt caatttactg attgtccgtc 15100 

gageaaattt acacattgcc actaaaegtc taaaeeettg taatttgttt ttgttttaet 15160 

atgtgtgtta tgtatttgat ttgcgataaa tttttatatt tggtaetaaa tttataacae 15220 

ettttatgct aacgtttgcc aacacttage aatttgcaag ttgattaatt gattctaaat 15280 

tatttttgtc ttctaaatae atatactaat caactggaaa tgtaaatatt tgctaatatt 15340 

tetactatag gagaattaaa gtgagtgaat atggtaecae aaggtttgga gatttaattg 15400 
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ttgcaatgct gcatggatgg catatacacc aaacattcaa taattcttga ggataataat 
ggtaccacac aagatttgag gtgcatgaac gtcacgtgga caaaaggttt agtaattttt 
caagacaaca atgttaccac acacaagttt tgaggtgcat gcatggatgc cctgtggaaa 
gtttaaaaat attttggaaa tgatttgcat ggaagccatg tgtaaaacca tgacatccac 
ttggaggatg caataatgaa gaaaactaca aatttacatg caactagtta tgcatgtagt 
ctatataatg aggattttgc aatactttca ttcatacaca ctcactaagt tttacacgat 
tataatttct tcatagccag cagatctaaa atg get ccg gat gcg gat aag ctt 

Met Ala Pro Asp Ala Asp Lys Leu 
820 

cga caa cgc cag acg act gcg gta gcg aag cac aat get get ace ata 
Arg Gin Arg Gin Thr Thr Ala Val Ala Lys His Asn Ala Ala Thr lie 

825 830 835 

teg acg cag gaa cgc ctt tgc agt ctg tct teg etc aaa gge gaa gaa 
Ser Thr Gin Glu Arg Leu Cys Ser Leu Ser Ser Leu Lys Gly Glu Glu 
840 845 850 855 

gtc tgc ate gac gga ate ate tat gae etc eaa tea ttc gat eat ecc 
Val Cys lie Asp Gly lie lie Tyr Asp Leu Gin Ser Phe Asp His Pro 

860 865 870 

ggg ggt gaa acg ate aaa atg ttt ggt gge aae gat gtc act gta cag 
Gly Gly Glu Thr He Lys Met Phe Gly Gly Asn Asp Val Thr Val Gin 

875 880 885 

tac aag atg att eae ccg tac eat ace gag aag cat ttg gaa aag atg 
Tyr Lys Met He His Pro Tyr His Thr Glu Lys His Leu Glu Lys Met 

890 895 900 

aag cgt gtc gge aag gtg acg gat ttc gtc tgc gag tac aag ttc gat 
Lys Arg Val Gly Lys Val Thr Asp Phe Val Cys Glu Tyr Lys Phe Asp 

905 910 915 

acc gaa ttt gaa cgc gaa ate aaa cga gaa gtc ttc aag att gtg cga 
Thr Glu Phe Glu Arg Glu He Lys Arg Glu Val Phe Lys He Val Arg 
920 925 930 935 

cga gge aag gat ttc ggt act ttg gga tgg ttc ttc cgt gcg ttt tgc 
Arg Gly Lys Asp Phe Gly Thr Leu Gly Trp Phe Phe Arg Ala Phe Cys 

940 945 950 

tac att gee att ttc ttc tac ctg eag tac cat tgg gtc acc acg gga 
Tyr He Ala He Phe Phe Tyr Leu Gin Tyr His Trp Val Thr Thr Gly 

955 960 965 

acc tct tgg ctg ctg gee gtg gee tac gga ate tec caa gcg atg att 
Thr Ser Trp Leu Leu Ala Val Ala Tyr Gly He Ser Gin Ala Met He 

970 975 980 

gge atg aat gtc cag cac gat gee aae eae ggg gee ace tec aag cgt 
Gly Met Asn Val Gin His Asp Ala Asn His Gly Ala Thr Ser Lys Arg 

985 990 995 

ecc tgg gtc aae gae atg eta gge etc ggt gcg gat ttt att ggt 



Pro Trp Val Asn Asp Met 
1000 1005 
ggt tee aag tgg etc tgg 
Gly Ser Lys Trp Leu Trp 
1015 1020 
tac acc aat cac gee gag 
Tyr Thr Asn His Ala Glu 
1030 1035 
cca atg etc eta ttc aae 
Pro Met Leu Leu Phe Asn 
1045 1050 
acc tgg eta eat cgc ttt 
Thr Trp Leu His Arg Phe 
1060 1065 



Leu Gly Leu Gly Ala 
1010 

eag gaa eaa eae tgg 
Gin Glu Gin His Trp 
1025 

atg gat ecc gat age 
Met Asp Pro Asp Ser 
1040 

gac tat ecc ttg gat 
Asp Tyr Pro Leu Asp 
1055 

caa gca ttc ttt tac 
Gin Ala Phe Phe Tyr 
1070 



Asp Phe He Gly 

ace eae eae get 
Thr His His Ala 

ttt ggt gee gaa 
Phe Gly Ala Glu 

cat ecc get cgt 
His Pro Ala Arg 

atg ecc gtc ttg 
Met Pro Val Leu 



15460 
15520 
15580 
15640 
15700 
15760 
15814 



15862 



15910 



15958 



16006 



16054 



16102 



16150 



16198 



16246 



16294 



16342 



16387 



16432 



16477 



16522 



16567 
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get 


aaa 

y y a 


t ac 


taa 

v,y y 


tta 

i_ s_y 


tec 


get 


gtc 


ttc 


aat 


cea 


caa 


att 


ett 


gae 


16612 


Ala 


Glv 


T vr 


TrD 


Leu 


Ser 


Ala 


Val 


Phe 


Asn 


Pro 


Gin 


lie 


Leu 


Asp 




1075 










1080 










1085 












etc 


cag 


caa 


cge 


aac 
y y ^ 


gea 


ett 


tec 


ate 


aat 


ate 


cgt 


etc 


aac 


aac 


16657 


LiGU. 


Gin 


Gin 


Arg 


Gly 


Ala 


Leu 


Ser 


Val 


Gly 


lie 


Arg 


Leu 


Asp 


Asn 




1090 










1095 










1100 












get 


ttc 


att 


cac 


teg 


ega 


cge 
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Ser 


Ala 


Asp 


Arg 


Asp 


Pro 


Thr 


Ala 


Pro 


Leu 


Lys 


Lys 


Thr 


Gly 


Glu 



78 







355 










360 






365 








Pro 


Val 


Asp 


Trp 


Phe 


Lys 


Thr 


Gin 


Val Glu 


Thr Ser 


Cys 


Thr 


Tyr 


Gly 




370 










375 






380 










Gly 


Phe 


Leu 


Ser 


Gly 


Cys 


Phe 


Thr 


Gly Gly 


Leu Asn 


Phe 


Gin 


Val 


Glu 


385 










390 








395 








400 


His 


His 


Leu 


Phe 


Pro 


Arg 


Met 


Ser 


Ser Ala 


Trp Tyr 


Pro 


Tyr 


He 


Ala 










405 








410 








415 




Pro 


Lys 


Val 


Arg 


Glu 


He 


Cys 


Ala 


Lys His 


Gly Val 


His 


Tyr Ala 


Tyr 








420 










425 






430 






Tyr 


Pro 


Trp 


He 


His 


Gin 


Asn 


Phe 


Leu Ser 


Thr Val 


Arg 


Tyr 


Met 


His 






435 










440 






445 








Ala 


Ala 


Gly 


Thr 


Gly 


Ala 


Asn 


Trp Arg Gin Met Ala 


Arg 


Glu 


Asn 


Pro 




450 










455 






460 










Leu 


Thr 


Gly 


Arg 


Ala 





















465 

<210> 60 

<211> 26 

<212> DNA 
<213> unknown 

<220> 

<223> primer 
<220> 

<221> misc_feature 

<222> (1) . . (26) 

<223> primer 

<400> 60 

gaattcggcg cgccgagctc ctcgag 26 

<210> 61 

<211> 265 
<212> DNA 
<213> unknown 

<220> 

<223> primer 
<220> 

<221> misc_feature 

<222> (1)..(265) 

<223> primer 

<400> 61 

ccaccgcggt gggcggccgc ctgcagtcta gaaggcctcc tgctttaatg agatatgcga 60 

gacgcctatg atcgcatgat atttgctttc aattctgttg tgcacgttgt aaaaaacctg 120 

agcatgtgta gctcagatcc ttaccgccgg tttcggttca ttctaatgaa tatatcaccc 180 

gttactatcg tatttttatg aataatattc tccgttcaat ttactgattg tccgtcgacg 240 

aattcgagct cggcgcgcca agctt 265 



<210> 62 
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<211> 257 
<212> DNA 
<213> unknown 

<220> 

<223> primer 
<220> 

<221> misc_f eature 

<222> (1)..(257) 

<223> primer 

<400> 62 

ggatccgata tcgggcccgc tagcgttaac cctgctttaa tgagatatgc gagacgccta 60 

tgatcgcatg atatttgctt tcaattctgt tgtgcacgtt gtaaaaaacc tgagcatgtg 120 

tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac ccgttactat 180 

cgtattttta tgaataatat tctccgttca atttactgat tgtccgtcga cgaattcgag 240 

ctcggcgcgc caagctt 257 



<210> 63 

<211> 5410 

<212> DNA 

<213> unknown 

<220> 

<223> primer 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(1) . . (5410) 

Plant expression vector 



<400> 63 



ttttggaaat 
aataatgaag 
ggattttgca 
catagccagc 
gagacgccta 
tgagcatgtg 
ccgttactat 
gcaaatttac 
gtgtgttatg 
tttatgctaa 
tttttgtctt 
tactatagga 
gcaatgctgc 
taccacacaa 
agacaacaat 
ttaaaaatat 
ggaggatgca 
atataatgag 
taatttcttc 



gatttgcatg 
aaaactacaa 
atactttcat 
ggatccgata 
tgatcgcatg 
tagctcagat 
cgtattttta 
acattgccac 
tatttgattt 
cgtttgccaa 
ctaaatacat 
gaattaaagt 
atggatggca 
gatttgaggt 
gttaccacac 
tttggaaatg 
ataatgaaga 
gattttgcaa 
atagccagca 



gaagccatgt 
atttacatgc 
tcatacacac 
tcgggcccgc 
atatttgctt 
ccttaccgcc 
tgaataatat 
taaacgtcta 
gcgataaatt 
cacttagcaa 
atactaatca 
gagtgaatat 
tatacaccaa 
gcatgaacgt 
acaagttttg 
atttgcatgg 
aaactacaaa 
tactttcatt 
gatctgccgg 



gtaaaaccat 
aactagttat 
tcactaagtt 
tagcgttaac 
tcaattctgt 
ggtttcggtt 
tctccgttca 
aacccttgta 
tttatatttg 
tttgcaagtt 
actggaaatg 
ggtaccacaa 
acattcaata 
cacgtggaca 
aggtgcatgc 
aagccatgtg 
tttacatgca 
catacacact 
catcgatccc 



gacatccact 
gcatgtagtc 
ttacacgatt 
cctgctttaa 
tgtgcacgtt 
cattctaatg 
atttactgat 
atttgttttt 
gtactaaatt 
gattaattga 
taaatatttg 
ggtttggaga 
attcttgagg 
aaaggtttag 
atggatgccc 
taaaaccatg 
actagttatg 
cactaagttt 
gggccatggc 



tggaggatgc 
tatataatga 
ataatttctt 
tgagatatgc 
gtaaaaaacc 
aatatatcac 
tgtccgtcga 
gttttactat 
tataacacct 
ttctaaatta 
ctaatatttc 
tttaattgtt 
ataataatgg 
taatttttca 
tgtggaaagt 
acatccactt 
catgtagtct 
tacacgatta 
ctgctttaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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gagatatgcg agacgcctat gatcgcatga tatttgcttt caattctgtt gtgcacgttg 1200 

taaaaaacct gagcatgtgt agctcagatc cttaccgccg gtttcggttc attctaatga 1260 

atatatcacc cgttactatc gtatttttat gaataatatt ctccgttcaa tttactgatt 1320 

gtccgtcgac gagctcggcg cgccaagctt ggcgtaatca tggtcatagc tgtttcctgt 1380 

gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa 1440 

agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc 1500 

tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag 1560 

aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 1620 

cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 1680 

atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 1740 

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 1800 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 1860 

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 1920 

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 1980 
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc . 2040 

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 2100 

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 2160 

tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 2220 

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 2280 

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa ' 2340 

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 2400 

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 24 60 

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 2520 

cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 2580 

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 2640 

ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 2700 

aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 2760 

ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 2820 

caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 2880 

attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 2940 

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 3000 

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 3060 

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 3120 

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 3180 

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 3240 

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 3300 

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 3360 

gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 3420 

gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 3480 

ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 3540 

gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga 3600 

tgacggtgaa aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc 3660 

ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg 3720 

ctggcttaac tatgcggcat cagagcagat tgtactgaga gtgcaccata tgcggtgtga 3780 

aataccgcac agatgcgtaa ggagaaaata ccgcatcagg cgccattcgc cattcaggct 3840 

gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa 3900 

agggggatgt gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg 3960 

ttgtaaaacg acggccagtg aattcggcgc gccgagctcc tcgagcaaat ttacacattg 4020 

ccactaaacg tctaaaccct tgtaatttgt ttttgtttta ctatgtgtgt tatgtatttg 4080 

atttgcgata aatttttata tttggtacta aatttataac accttttatg ctaacgtttg 4140 

ccaacactta gcaatttgca agttgattaa ttgattctaa attatttttg tcttctaaat 4200 

acatatacta atcaactgga aatgtaaata tttgctaata tttctactat aggagaatta 4260 

aagtgagtga atatggtacc acaaggtttg gagatttaat tgttgcaatg ctgcatggat 4 320 

ggcatataca ccaaacattc aataattctt gaggataata atggtaccac acaagatttg 4 380 

aggtgcatga acgtcacgtg gacaaaaggt ttagtaattt ttcaagacaa caatgttacc 4440 

acacacaagt tttgaggtgc atgcatggat gccctgtgga aagtttaaaa atattttgga 4500 

aatgatttgc atggaagcca tgtgtaaaac catgacatcc acttggagga tgcaataatg 4560 
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aagaaaacta caaatttaca tgcaactagt tatgcatgta gtctatataa tgaggatttt 4 620 

gcaatacttt cattcataca cactcactaa gttttacacg attataattt cttcatagcc 4 680 

agcccaccgc ggtgggcggc cgcctgcagt ctagaaggcc tcctgcttta atgagatatg 4 74 0 

cgagacgcct atgatcgcat gatatttgct ttcaattctg ttgtgcacgt tgtaaaaaac 4800 

ctgagcatgt gtagctcaga tccttaccgc cggtttcggt tcattctaat gaatatatca 4860 

cccgttacta tcgtattttt atgaataata ttctccgttc aatttactga ttgtccgtcg 4 920 

agcaaattta cacattgcca ctaaacgtct aaacccttgt aatttgtttt tgttttacta 4980 

tgtgtgttat gtatttgatt tgcgataaat ttttatattt ggtactaaat ttataacacc 5040 

ttttatgcta acgtttgcca acacttagca atttgcaagt tgattaattg attctaaatt 5100 

atttttgtct tctaaataca tatactaatc aactggaaat gtaaatattt gctaatattt 5160 

ctactatagg agaattaaag tgagtgaata tggtaccaca aggtttggag atttaattgt 5220 

tgcaatgctg catggatggc atatacacca aacattcaat aattcttgag gataataatg 5280 

gtaccacaca agatttgagg tgcatgaacg tcacgtggac aaaaggttta gtaatttttc 5340 

aagacaacaa tgttaccaca cacaagtttt gaggtgcatg catggatgcc ctgtggaaag 54 00 

tttaaaaata 5410 

<210> 64 

<211> 12093 

<212> DNA 
<213> unknown 

<220> 

<223> primer 
<220> 

<221> misc_f eature 
<222> (1) . . (12093) 
<223> Plant expression vector 

<400> 64 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 54 0 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 84 0 

ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 

ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 

ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 

tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 

ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 

gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 

ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 14 4 0 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 
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gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 24 60 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 27 60 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 

ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 

aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 

tgccccccct tctcgaaccc tcccggcccg ctaac'gcggg cctcccatcc ccccaggggc 3000 

tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 

ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 

ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 

gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 

cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 

tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 

gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 

ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 

tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 

ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 

tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 

attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 

tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 

gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 

cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 

atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 

gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 

tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 

cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 

catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 

ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 

acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 

agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4 44 0 

tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4 920 
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gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 54 60 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 57 60 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 

cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 

cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 654 0 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 

gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 

cggcgaagcc tgcgaa'gagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 

tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 

agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 

tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 

ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 

cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 

cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 

ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 

aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 

ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 

cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 

tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 

acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gctaggtagc 7560 

ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7620 

gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 

gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 

acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 

tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 

ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 

acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 

catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 

aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 

cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 

cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 

tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 

tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 834 0 
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gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 

gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 84 60 

tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 

gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 

tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 

gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 

agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8760 

gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 

tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 

ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 

gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 

gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 

aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 

ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 

tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 

tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 

agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 

tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 94 80 

tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 

ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 

tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 

agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 

gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 

gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 

gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 

tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 

agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 

ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 

ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 

cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 

tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 

agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 

acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 

tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 

gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 

gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 

cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 

ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 

actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 

attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 

attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat "11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 114 60 

ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagccca ccgcggtggg cggccgcctg cagtctagaa ggcctcctgc tttaatgaga 11580 

tatgcgagac gcctatgatc gcatgatatt tgctttcaat tctgttgtgc acgttgtaaa 11640 

aaacctgagc atgtgtagct cagatcctta ccgccggttt cggttcattc taatgaatat 11700 

atcacccgtt actatcgtat ttttatgaat aatattctcc gttcaattta ctgattgtcc 11760 
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gtcgacgaat tcgagctcgg cgcgcctcta gaggatcgat gaattcagat cggctgagtg 11820 

gctccttcaa cgttgcggtt ctgtcagttc caaacgtaaa acggcttgtc ccgcgtcatc 11880 

ggcgggggtc ataacgtgac tcccttaatt ctccgctcat gatcagattg tcgtttcccg 11940 

ccttcagttt aaactatcag tgtttgacag gatatattgg cgggtaaacc taagagaaaa 12000 

gagcgtttat tagaataatc ggatatttaa aagggcgtga aaaggtttat ccttcgtcca 12060 

tttgtatgtg catgccaacc acagggttcc cca 12093 
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<212> DNA 
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gatctggcgc 

gcgcccagca 
tagtgggcgg 
ataatcaggc 
atgttgggtt 
ttctttatca 
tgacaaagtt 
gcgtagacgg 
tttactggca 
cggagaatca 
ggaatgcccg 
ccggcacgcg 
gcgaggcggg 
ctgttggggc 
ccgttgaaca 
ccggtccgga 
ggaggctcgt 
tgccggagcg 
ccaccgcgtc 
ccaagcctca 
gctcactgac 
ggcggtaata 
aggccagcaa 
ccgcccccct 
aggactataa 
gaccctgccg 
ccgctgcata 
tcgcacgata 
ggcgtcagcc 
ctgtccctta 
ctaccgccgg 
agggcagccc 
aggcggcggc 
aaatcacggg 
tgggccgcct 
tcggtgatgc 



cggccagcga 

caggtgcgca 
tgacgtcgtt 
cgatgccgac 
tcacgtctgg 
ctgataagtt 
gcagccgaat 
tctgacgaca 
cttcaggaac 
tacgcattcg 
cagcttcagg 
accgggcgca 
tttttcggcc 
cgtgcttgag 
ggctccgctc 
cgcagcgttc 
tgtcaggaac 
caacccactc 
agacgcccgt 
cggccgcgct 
tcgctgcgct 
cggttatcca 
aaggccagga 
gacgagcatc 
agataccagg 
cttaccggat 
accctgcttc 
tacaggattt 
gggcaggata 
ttcgcacctg 
cgtaacagat 
acctatcaag 
ggccggcatg 
cgtcgtggac 
gggcggcctg 
cacgatcctc 



gacgagcaag 
ggcaaattgc 
cgagtgaacc 
agcgtcgagc 
cctccggacc 
ggtggacata 
acagtgatcc 
cgcaaactgg 
aagcgggcgc 
gtgccgagag 
caggcgctgc 
ccgcagatgg 
ggggacgccg 
gagcaggccg 
tcgccgctgt 
gagcagggac 
gttgaaggac 
actacagcag 
agcagcccgc 
cggcctctct 
cggtcgttcg 
cagaatcagg 
accgtaaaaa 
acaaaaatcg 
cgtttccccc 
acctgtccgc 
ggggtcatta 
tgccaaaggg 
ggtgaagtag 
gcggtgctca 
gagggcaagc 
gtgtactgcc 
agcctgtcgg 
tatgagcacg 
ctgaaactct 
gccctgctgg 



attggccgcc 

accaacgcat 
agatcgcgca 
gcgacagtgc 
agcctccgct 
ttatgtttat 
gtgccgccct 
cggaacggtt 
tgctcgacgc 
ccgacgacga 
tcgcctaccg 
aaacggccga 
tcaatgcgct 
gcgacagcga 
tgcgggccgc 
tcgcggtgat 
cgagaaaggg 
agccatgtag 
tacgggcttt 
ggcggccttc 
gctgcggcga 
ggataacgca 
ggccgcgttg 
acgctcaagt 
tggaagctcc 
ctttctccct 
tagcgatttt 
ttcgtgtaga 
gcccacccgc 
acgggaatcc 
ggatggctga 
ttccagacga 
cctacctgct 
tccgcgagct 
ggctcaccga 
cgaagatcga 



gcccgaaacg 

acagcgccag 
ggaggcccgg 
tcagaattac 
ggtccgattg 
cagtgataaa 
ggacctgttg 
gggggttcag 
actggccgaa 
ctggcgctca 
cgatggcgcg 
cgcgcagctt 
gatgacaatc 
tgccggcgag 
gatagacgcc 
tgtcgatgga 
tgacgattga 
acaacatccc 
ttcatgccct 
tggcgctctt 
gcggtatcag 
ggaaagaaca 
ctggcgtttt 
cagaggtggc 
ctcgtgcgct 
tcgggaagcg 
ttcggtatat 
ctttccttgg 
gagcgggtgt 
tgctctgcga 
tgaaaccaag 
acgaagagcg 
ggccgtcggc 
ggcccgcatc 
cgacccgcgc 
agagaagcag 



atccgacagc 

cagaatgcca 
cagcaccggc 
gatcaggggt 
aacgcgcgga 
gtgtcaagca 
aacgaggtcg 
cagccggcgc 
gccatgctgg 
tttctgatcg 
cgcatccatg 
cgcttcctct 
agctacttca 
cgcggcggca 
ttcgacgaag 
ttggcgaaaa 
tcaggaccgc 
ctcccccttt 
gccctagcgt 
ccgcttcctc 
ctcactcaaa 
tgtgagcaaa 
tccataggct 
gaaacccgac 
ctcctgttcc 
tggcgctttt 
ccatcctttt 
tgtatccaac 
tccttcttca 
ggctggccgg 
ccaaccagga 
attgaggaaa 
cagggctaca 
aatggcgacc 
acggcgcggt 
gacgagcttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 24 00 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 24 60 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 

ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 

aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 

tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 

tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 

ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 

ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg . 3180 

gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 

cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 

tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 

gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 

ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 34 80 

tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 

ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 

tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 

attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 

tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 

gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 

cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 

atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 

gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 

tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 

cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 

catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 

ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 

acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 

agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 44 4 0 

tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4 620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4 680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4 920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 54 60 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 
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cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 57 60 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 624 0 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 

cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 

cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 

gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 

cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 

tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 

agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 

tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 

ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 

cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 

cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 

ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 

aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 

ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 

cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 

tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 

acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gctaggtagc 7560 

ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7620 

gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 

gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 

acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 

tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 78 60 

ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 

acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 

catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 

aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 

cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 

cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 

tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 

tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 

gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 

gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 84 60 

tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 

gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 

tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 

gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 

agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8760 

gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 

tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 

ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
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gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 

gattaagttg ggtaacgcc.a gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 

aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 

ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 

tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 

tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 

agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 

tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 

tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 

ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 

tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 

agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 

gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 

gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 984 0 

gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 

tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 

agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 

ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 

ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 

cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 

tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 

agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 

acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 

tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 

gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 

gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 

cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 

ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 

actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 

attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 

attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 114 60 

ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagcgga tccgatatcg ggcccgctag cgttaaccct gctttaatga gatatgcgag 11580 

acgcctatga tcgcatgata tttgctttca attctgttgt gcacgttgta aaaaacctga 11640 

gcatgtgtag ctcagatcct taccgccggt ttcggttcat tctaatgaat atatcacccg 11700 

ttactatcgt atttttatga ataatattct ccgttcaatt tactgattgt ccgtcgacga 117 60 

attcgagctc ggcgcgcctc tagaggatcg atgaattcag atcggctgag tggctccttc 11820 

aacgttgcgg ttctgtcagt tccaaacgta aaacggcttg tcccgcgtca tcggcggggg 11880 

tcataacgtg actcccttaa ttctccgctc atgatcagat tgtcgtttcc cgccttcagt 11940 

ttaaactatc agtgtttgac aggatatatt ggcgggtaaa cctaagagaa aagagcgttt 12000 

attagaataa tcggatattt aaaagggcgt gaaaaggttt atccttcgtc catttgtatg 12060 

tgcatgccaa ccacagggtt cccca 12085 

<210> 66 



<211> 
<212> 



12079 
DNA 
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<213> unknown 
<220> 

<221> misc_f eature 
<222> (1) . . (12079) 
<223> Plant expression vector 

<400> 66 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 24 0 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 

ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 

ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 

ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 

tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 

ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 

gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 12 60 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 

ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 

gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 24 60 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 

ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 
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aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 

tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 

tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 30 60 

ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 

ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 

gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 

cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 

tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 

gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 

ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 

tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 

ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 

tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 

attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 

tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 

gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 

cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 

atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 

gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 

tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 

cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4 200 

catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 

ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 

acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 

agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 

tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4 620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4 680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac . ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4 980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

etggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 624 0 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
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cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 

cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 64 80 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 

gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 

cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 684 0 

tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 

agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 

tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 

ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 

cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 

cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 

ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 

aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 

ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 

cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 74 40 

tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 

acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gctaggtagc 7560 

ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7 620 

gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7 680 

gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 

acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 

tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 

ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 

acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 

catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 

aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 

cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 

cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 

tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 

tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 

gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 

gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 

tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 

gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 

tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 

gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 

agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8760 

gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 

tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 894 0 

ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 

gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 

gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 

aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 

ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 924 0 

tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 

tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 

agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 

tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 

tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 

ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 

tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 

agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 
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gcttccatcc 
gtagccggat 
gcaggagcaa 
tcccttcccg 
agccacgata 
ttgacaaaaa 
ccgattgtct 
cctgcgtgca 
tctggattga 
agtggagcat 
acgcgcaata 
tgagtggctc 
gtcatcggcg 
gcgccatcag 
cttaccagag 
gtctagctat 
ttcccttgtc 
actggctttc 
gcgtgaagct 
attgccacta 
tttgatttgc 
tttgccaaca 
aaatacatat 
attaaagtga 
ggatggcata 
tttgaggtgc 
taccacacac 
tggaaatgat 
aatgaagaaa 
ttttgcaata 
agccagcaga 
acgcctatga 
gcatgtgtag 
ttactatcgt 
gctcggcgcg 
gcggttctgt 
cgtgactccc 
tatcagtgtt 
ataatcggat 
ccaaccacag 



gagtacgtgc 
caagcgtatg 
ggtgagatga 
cttcagtgac 
gccgcgctgc 
gaaccgggcg 
gttgtgccca 
atccatcttg 
gagtgaatat 
ttttgacaag 
atggtttctg 
cttcaacgtt 
ggggtcataa 
atccttggcg 
ggcgccccag 
cgccatgtaa 
cagatagccc 
tacgtgttcc 
tgcatgcctg 
aacgtctaaa 
gataaatttt 
cttagcaatt 
actaatcaac 
gtgaatatgg 
tacaccaaac 
atgaacgtca 
aagttttgag 
ttgcatggaa 
actacaaatt 
ctttcattca 
tctgccggca 
tcgcatgata 
ctcagatcct 
atttttatga 
cctctagagg 
cagttccaaa 
ttaattctcc 
tgacaggata 
atttaaaagg 
ggttcccca 



tcgctcgatg 
cagccgccgc 
caggagatcc 
aacgtcgagc 
ctcgtcctgc 
cccctgcgct 
gtcatagccg 
ttcaatccaa 
gagactctaa 
aaatatttgc 
acgtatgtgc 
gcggttctgt 
cgtgactccc 
gcaagaaagc 
ctggcaattc 
gcccactgca 
agtagctgac 
gcttccttta 
caggtcgacg 
cccttgtaat 
tatatttggt 
tgcaagttga 
tggaaatgta 
taccacaagg 
attcaataat 
cgtggacaaa 
gtgcatgcat 
gccatgtgta 
tacatgcaac 
tacacactca 
tcgatcccgg 
tttgctttca 
taccgccggt 
ataatattct 
atcgatgaat 
cgtaaaacgg 
gctcatgatc 
tattggcggg 
gcgtgaaaag 



cgatgtttcg 
attgcatcag 
tgccccggca 
acagctgcgc 
agttcattca 
gacagccgga 
aatagcctct 
gctcccatgg 
ttggataccg 
tagctgatag 
ttagctcatt 
cagttccaaa 
ttaattctcc 
catccagttt 
cggttcgctt 
agctacctgc 
attcatccgg 
gcagcccttg 
gcgcgccgag 
ttgtttttgt 
actaaattta 
ttaattgatt 
aatatttgct 
tttggagatt 
tcttgaggat 
aggtttagta 
ggatgccctg 
aaaccatgac 
tagttatgca 
ctaagtttta 
gccatggcct 
attctgttgt 
ttcggttcat 
ccgttcaatt 
tcagatcggc 
cttgtcccgc 
agattgtcgt 
taaacctaag 
gtttatcctt 



cttggtggtc 
ccatgatgga 
cttcgcccaa 
aaggaacgcc 
gggcaccgga 
acacggcggc 
ccacccaagc 
gccctcgact 
aggggaattt 
tgaccttagg 
aaactccaga 
cgtaaaacgg 
gctcatgatc 
actttgcagg 
gctgtccata 
tttctctttg 
ggtcagcacc 
cgccctgagt 
ctcctcgagc 
tttactatgt 
taacaccttt 
ctaaattatt 
aatatttcta 
taattgttgc 
aataatggta 
atttttcaag 
tggaaagttt 
atccacttgg 
tgtagtctat 
cacgattata 
gctttaatga 
gcacgttgta 
tctaatgaat 
tactgattgt 
tgagtggctc 
gtcatcggcg 
ttcccgcctt 
agaaaagagc 
cgtccatttg 



gaatgggcag 
tactttctcg 
tagcagccag 
cgtcgtggcc 
caggtcggtc 
atcagagcag 
ggccggagaa 
agagtcgaga 
atggaacgtc 
cgacttttga 
aacccgcggc 
cttgtcccgc 
ttgatcccct 
gcttcccaac 
aaaccgccca 
cgcttgcgtt 
gtttctgcgg 
gcttgcggca 
aaatttacac 
gtgttatgta 
tatgctaacg 
tttgtcttct 
ctataggaga 
aatgctgcat 
ccacacaaga 
acaacaatgt 
aaaaatattt 
aggatgcaat 
ataatgagga 
atttcttcat 
gatatgcgag 
aaaaacctga 
atatcacccg 
ccgtcgacga 
cttcaacgtt 
ggggtcataa 
cagtttaaac 
gtttattaga 
tatgtgcatg 



9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12079 
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<211> 13002 
<212> DNA 
<213> unknown 



<220> 

<221> misc_feature 
<222> (1) . . (13002) 

<223> Plant expression vector with two promoter-terminator expression 
cassettes 



<400> 67 



gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 
gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 



60 
120 
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tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 
atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 
tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 
gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 
cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 
ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 
ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 

ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 

tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 

ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 

gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 

. ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 14 4 0 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 

gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 18 60 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 24 60 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 

ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 

aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 

tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 

tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 

ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 

ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 

gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 

cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 

tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 

gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 

ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 

tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
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ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 

tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 

attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 

tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 

gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 

cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 

atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 

gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 

tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 

cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 

catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 

ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 

acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 

agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 

tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 
aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt • 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4 680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 534 0 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 54 60 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 57 60 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 594 0 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 624 0 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 

cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 

cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 64 80 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 

qtttqctggc qaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 

cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 

tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 

agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 
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tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 

ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 

cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 

cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 

ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 

aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 

ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 

cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 74 40 

tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 

acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gctaggtagc 7560 

ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7620 

gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 

gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 

acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 

tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 

ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 

acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 

catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 

aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 

cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 

cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 

tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 

tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 

gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 

gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 

tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 

gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 

tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 

gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 

agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa' 8760 

gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 

tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 

ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 

gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 

gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 

aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 

ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 

tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 

tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 

agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 94 20 

tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 

tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 

ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 

tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 

agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 

gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 

gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 

gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 

tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 

agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 

ttgacaaaaa gaaccqggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 

ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 

cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 

tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 

agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 

acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
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tgagtggctc 
gtcatcggcg 
gcgccatcag 
cttaccagag 
gtctagctat 
ttcccttgtc 
actggctttc 
gcgtgaagct 
attgccacta 
tttgatttgc 
tttgccaaca 
aaatacatat 
attaaagtga 
ggatggcata 
tttgaggtgc 
taccacacac 
tggaaatgat 
aatgaagaaa 
ttttgcaata 
agccagccca 
tatgcgagac 
aaacctgagc 
atcacccgtt 
gtcgagcaaa 
actatgtgtg 
caccttttat 
aattattttt 
atttctacta 
ttgttgcaat 
aatggtacca 
tttcaagaca 
aaagtttaaa 
cacttggagg 
agtctatata 
gattataatt 
ttaatgagat 
cgttgtaaaa 
aatgaatata 
tgattgtccg 
ggctgagtgg 
cgcgtcatcg 
cgtttcccgc 
aagagaaaag 
cttcgtccat 



cttcaacgtt 
ggggtcataa 
atccttggcg 
ggcgccccag 
cgccatgtaa 
cagatagccc 
tacgtgttcc 
tgcatgcctg 
aacgtctaaa 
gataaatttt 
cttagcaatt 
actaatcaac 
gtgaatatgg 
tacaccaaac 
atgaacgtca 
aagttttgag 
ttgcatggaa 
actacaaatt 
ctttcattca 
ccgcggtggg 
gcctatgatc 
atgtgtagct 
actatcgtat 
tttacacatt 
ttatgtattt 
gctaacgttt 
gtcttctaaa 
taggagaatt 
gctgcatgga 
cacaagattt 
acaatgttac 
aatattttgg 
atgcaataat 
atgaggattt 
tcttcatagc 
atgcgagacg 
aacctgagca 
tcacccgtta 
tcgacgaatt 
ctccttcaac 
gcgggggtca 
cttcagttta 
agcgtttatt 
ttgtatgtgc 



gcggttctgt 
cgtgactccc 
gcaagaaagc 
ctggcaattc 
gcccactgca 
agtagctgac 
gcttccttta 
caggtcgacg 
cccttgtaat 
tatatttggt 
tgcaagttga 
tggaaatgta 
taccacaagg 
attcaataat 
cgtggacaaa 
gtgcatgcat 
gccatgtgta 
tacatgcaac 
tacacactca 
cggccgcctg 
gcatgatatt 
cagatcctta 
ttttatgaat 
gccactaaac 
gatttgcgat 
gccaacactt 
tacatatact 
aaagtgagtg 
tggcatatac 
gaggtgcatg 
cacacacaag 
aaatgatttg 
gaagaaaact 
tgcaatactt 
cagcggatcc 
cctatgatcg 
tgtgtagctc 
ctatcgtatt 
cgagctcggc 
gttgcggttc 
taacgtgact 
aactatcagt 
agaataatcg 
atgccaacca 



cagttccaaa 
ttaattctcc 
catccagttt 
cggttcgctt 
agctacctgc 
attcatccgg 
gcagcccttg 
gcgcgccgag 
ttgtttttgt 
actaaattta 
ttaattgatt 
aatatttgct 
tttggagatt 
tcttgaggat 
aggtttagta 
ggatgccctg 
aaaccatgac 
tagttatgca 
ctaagtttta 
cagtctagaa 
tgctttcaat 
ccgccggttt 
aatattctcc 
gtctaaaccc 
aaatttttat 
agcaatttgc 
aatcaactgg 
aatatggtac 
accaaacatt 
aacgtcacgt 
ttttgaggtg 
catggaagcc 
acaaatttac 
tcattcatac 
gatatcgggc 
catgatattt 
agatccttac 
tttatgaata 
gcgcctctag 
tgtcagttcc 
cccttaattc 
gtttgacagg 
gatatttaaa 
cagggttccc 



cgtaaaacgg 
gctcatgatc 
actttgcagg 
gctgtccata 
tttctctttg 
ggtcagcacc 
cgccctgagt 
ctcctcgagc 
tttactatgt 
taacaccttt 
ctaaattatt 
aatatttcta 
taattgttgc 
aataatggta 
atttttcaag 
tggaaagttt 
atccacttgg 
tgtagtctat 
cacgattata 
ggcctcctgc 
tctgttgtgc 
cggttcattc 
gttcaattta 
ttgtaatttg 
atttggtact 
aagttgatta 
aaatgtaaat 
cacaaggttt 
caataattct 
ggacaaaagg 
catgcatgga 
atgtgtaaaa 
atgcaactag 
acactcacta 
ccgctagcgt 
gctttcaatt 
cgccggtttc 
atattctccg 
aggatcgatg 
aaacgtaaaa 
tccgctcatg 
atatattggc 
agggcgtgaa 
ca 



cttgtcccgc 
ttgatcccct 
gcttcccaac 
aaaccgccca 
cgcttgcgtt 
gtttctgcgg 
gcttgcggca 
aaatttacac 
gtgttatgta 
tatgctaacg 
tttgtcttct 
ctataggaga 
aatgctgcat 
ccacacaaga 
acaacaatgt 
aaaaatattt 
aggatgcaat 
ataatgagga 
atttcttcat 
tttaatgaga 
acgttgtaaa 
taatgaatat 
ctgattgtcc 
tttttgtttt 
aaatttataa 
attgattcta 
atttgctaat 
ggagatttaa 
tgaggataat 
tttagtaatt 
tgccctgtgg 
ccatgacatc 
ttatgcatgt 
agttttacac 
taaccctgct 
ctgttgtgca 
ggttcattct 
ttcaatttac 
aattcagatc 
cggcttgtcc 
atcagattgt 
gggtaaacct 
aaggtttatc 



10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13002 



<210> 68 

<211> 13905 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<223> Plant expression vector with three promoter-terminator expression 
cassettes 



<400> 68 
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gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 
gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca ' 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag • 900 

ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 

ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 

tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 

ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 

gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 12 60 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 

ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 

gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac .1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 174 0 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 24 60 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 

ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 

aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 

tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 

tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 

ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 

ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 

gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 

cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 

tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
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gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 

ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 34 80 

tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 354 0 

ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 

tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 
attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt ■ 3720 

tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 

gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 

cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 

atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 

gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 

tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 

cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 

catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 

ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 

acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 

agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 

tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4 980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 534 0 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 54 60 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 57 60 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 

cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 

cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 654 0 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 

gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 
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cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 

tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 

agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 

tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 

ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 

cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 714 0 

cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 

ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 

aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 

ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 

cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 

tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 

acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gctaggtagc 7560 

ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7 62 0 

gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 

gtttccatgg cgttcggaac cgt.gctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 

acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 

tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 78 60 

ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 

acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 

catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 

aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 

cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 

cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 

tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 

tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 

gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 

gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 84 60 

tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 

gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 

tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 

gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 

agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 87 60 

gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 

tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 

ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 

gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 

gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 

aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 

ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 

tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 

tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 

agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 

tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 94 80 

tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 

ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 

tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 

agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 

gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 

gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 

gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 

tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 

agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 

ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 

ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 

cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 
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tctggattga 
agtggagcat 
acgcgcaata 
tgagtggctc 
gtcatcggcg 
gcgccatcag 
cttaccagag 
gtctagctat 
ttcccttgtc 
actggctttc 
gcgtgaagct 
attgccacta 
tttgatttgc 
tttgccaaca 
aaatacatat 
attaaagtga 
ggatggcata 
tttgaggtgc 
taccacacac 
tggaaatgat 
aatgaagaaa 
ttttgcaata 
agccagccca 
tatgcgagac 
aaacctgagc 
atcacccgtt 
gtcgagcaaa 
actatgtgtg 
caccttttat 
aattattttt 
atttctacta 
ttgttgcaat 
aatggtacca 
tttcaagaca 
aaagtttaaa 
cacttggagg 
agtctatata 
gattataatt 
ttaatgagat 
cgttgtaaaa 
aatgaatata 
tgattgtccg 
ttttgtttta 
aatttataac 
ttgattctaa 
tttgctaata 
gagatttaat 
gaggataata 
ttagtaattt 
gccctgtgga 
catgacatcc 
tatgcatgta 
gttttacacg 
tggcctgctt 
tgttgtgcac 
gttcattcta 
tcaatttact 



gagtgaatat 
ttttgacaag 
atggtttctg 
cttcaacgtt 
ggggtcataa 
atccttggcg 
ggcgccccag 
cgccatgtaa 
cagatagccc 
tacgtgttcc 
tgcatgcctg 
aacgtctaaa 
gataaatttt 
cttagcaatt 
actaatcaac 
gtgaatatgg 
tacaccaaac 
atgaacgtca 
aagttttgag 
ttgcatggaa 
actacaaatt 
ctttcattca 
ccgcggtggg 
gcctatgatc 
atgtgtagct 
actatcgtat 
tttacacatt 
ttatgtattt 
gctaacgttt 
gtcttctaaa 
taggagaatt 
gctgcatgga 
cacaagattt 
acaatgttac 
aatattttgg 
atgcaataat 
atgaggattt 
tcttcatagc 
atgcgagacg 
aacctgagca 
tcacccgtta 
tcgagcaaat 
ctatgtgtgt 
accttttatg 
attatttttg 
tttctactat 
tgttgcaatg 
atggtaccac 
ttcaagacaa 
aagtttaaaa 
acttggagga 
gtctatataa 
attataattt 
taatgagata 
gttgtaaaaa 
atgaatatat 
gattgtccgt 



gagactctaa 
aaatatttgc 
acgtatgtgc 
gcggttctgt 
cgtgactccc 
gcaagaaagc 
ctggcaattc 
gcccactgca 
agtagctgac 
gcttccttta 
caggtcgacg 
cccttgtaat 
tatatttggt 
tgcaagttga 
tggaaatgta 
taccacaagg 
attcaataat 
cgtggacaaa 
gtgcatgcat 
gccatgtgta 
tacatgcaac 
tacacactca 
cggccgcctg 
gcatgatatt 
cagatcctta 
ttttatgaat 
gccactaaac 
gatttgcgat 
gccaacactt 
tacatatact 
aaagtgagtg 
tggcatatac 
gaggtgcatg 
cacacacaag 
aaatgatttg 
gaagaaaact 
tgcaatactt 
cagcggatcc 
cctatgatcg 
tgtgtagctc 
ctatcgtatt 
ttacacattg 
tatgtatttg 
ctaacgtttg 
tcttctaaat 
aggagaatta 
ctgcatggat 
acaagatttg 
caatgttacc 
atattttgga 
tgcaataatg 
tgaggatttt 
cttcatagcc 
tgcgagacgc 
acctgagcat 
cacccgttac 
cgacgagctc 



ttggataccg 
tagctgatag 
ttagctcatt 
cagttccaaa 
ttaattctcc 
catccagttt 
cggttcgctt 
agctacctgc 
attcatccgg 
gcagcccttg 
gcgcgccgag 
ttgtttttgt 
actaaattta 
ttaattgatt 
aatatttgct 
tttggagatt 
tcttgaggat 
aggtttagta 
ggatgccctg 
aaaccatgac 
tagttatgca 
ctaagtttta 
cagtctagaa 
tgctttcaat 
ccgccggttt 
aatattctcc 
gtctaaaccc 
aaatttttat 
agcaatttgc 
aatcaactgg 
aatatggtac 
accaaacatt 
aacgtcacgt 
ttttgaggtg 
catggaagcc 
acaaatttac 
tcattcatac 
gatatcgggc 
catgatattt 
agatccttac 
tttatgaata 
ccactaaacg 
atttgcgata 
ccaacactta 
acatatacta 
aagtgagtga 
ggcatataca 
aggtgcatga 
acacacaagt 
aatgatttgc 
aagaaaacta 
gcaatacttt 
agcagatctg 
ctatgatcgc 
gtgtagctca 
tatcgtattt 
ggcgcgcctc 



aggggaattt 
tgaccttagg 
aaactccaga 
cgtaaaacgg 
gctcatgatc 
actttgcagg 
gctgtccata 
tttctctttg 
ggtcagcacc 
cgccctgagt 
ctcctcgagc 
tttactatgt 
taacaccttt 
ctaaattatt 
aatatttcta 
taattgttgc 
aataatggta 
atttttcaag 
tggaaagttt 
atccacttgg 
tgtagtctat 
cacgattata 
ggcctcctgc 
tctgttgtgc 
cggttcattc 
gttcaattta 
ttgtaatttg 
atttggtact 
aagttgatta 
aaatgtaaat 
cacaaggttt 
caataattct 
ggacaaaagg 
catgcatgga 
atgtgtaaaa 
atgcaactag 
acactcacta 
ccgctagcgt 
gctttcaatt 
cgccggtttc 
atattctccg 
tctaaaccct 
aatttttata 
gcaatttgca 
atcaactgga 
atatggtacc 
ccaaacattc 
acgtcacgtg 
tttgaggtgc 
atggaagcca 
caaatttaca 
cattcataca 
ccggcatcga 
atgatatttg 
gatccttacc 
ttatgaataa 
tagaggatcg 



atggaacgtc 

cgacttttga 

aacccgcggc 

cttgtcccgc 

ttgatcccct 

gcttcccaac 

aaaccgccca 

cgcttgcgtt 

gtttctgcgg 

gcttgcggca 

aaatttacac 

gtgttatgta 

tatgctaacg 

tttgtcttct 

ctataggaga 

aatgctgcat 

ccacacaaga 

acaacaatgt 

aaaaatattt 

aggatgcaat 

ataatgagga 

atttcttcat 

tttaatgaga 

acgttgtaaa 

taatgaatat 

ctgattgtcc 

tttttgtttt 

aaatttataa 

attgattcta 

atttgctaat 

ggagatttaa 

tgaggataat 

tttagtaatt 

tgccctgtgg 

ccatgacatc 

ttatgcatgt 

agttttacac 

taaccctgct 

ctgttgtgca ■ 

ggttcattct 

ttcaatttac 

tgtaatttgt 

tttggtacta 

agttgattaa 

aatgtaaata 

acaaggtttg 

aataattctt 

gacaaaaggt 

atgcatggat 

tgtgtaaaac 

tgcaactagt 

cactcactaa 

tcccgggcca 

ctttcaattc 

gccggtttcg 

tattctccgt 

atgaattcag 



10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
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atcggctgag tggctccttc aacgttgcgg ttctgtcagt tccaaacgta aaacggcttg 13680 

tcccgcgtca tcggcggggg tcataacgtg actcccttaa ttctccgctc atgatcagat 13740 

tgtcgtttcc cgccttcagt ttaaactatc agtgtttgac aggatatatt ggcgggtaaa 13800 

cctaagagaa aagagcgttt attagaataa tcggatattt aaaagggcgt gaaaaggttt 138 60 

atccttcgtc catttgtatg tgcatgccaa ccacagggtt cccca 13905 

<210> 69 

<211> 1443 

<212> DNA 

<213> Phaeodactylum tricornutum 

<220> 

<221> CDS 

<222> (9).. (1442) 

<223> Delta- G-desaturase 

<400> 69 



gatctaaa atg ggc aaa gga ggg gac get egg gee teg aag ggc tea aeg 50 
Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser Thr 
15 10 



geg 


get 


ege 








^gg 




dd 




dd<^ 






gcg 






98 


Ala 


Ala 


Airg 


Lys 


He 


Ser 


Trn 


Gin 


Glu 


Val 


Lys 


Thr 


His 


Ala 


Ser 


Pro 




15 










20 










25 










30 






gac 


gee 




ate 


att 


cac 


t cc 


aat 


aag 


gt c 


tac 


gac 


at cr 


tec 


aac 


146 


O JL U 




Ala 


T m 


Tie 

-L -L ~ 


Tie 


His 


C -i- 


Asn 


Lys 


Val 




Asp 


Val 


Ser 


Asn 












35 










40 










45 






t aa 


cae 


gaa 


cat 


ecc 


Qoa 

y y a 


aac 


gec 


gte 


att 


ttc 


aeg 


cac 


ace 


aat 


aac 


194 


Trp 


His 


Glu 


His 


Pro 


Gly 


Gly 


Ala 


Val 


He 


Phe 


Thr 


His 


Ala 


Gly 


Asp 










50 










55 










60 








gae 


atg 


aeg 


gac 


att 


ttc 


get 


gee 


ttt 


cac 


gea 


ecc 


gga 


teg 


cag 


teg 


242 


Asp 


Met 


Thr 


Asp 


He 


Phe 


Ala 


Ala 


Phe 


His 


Ala 


Pro 


Gly 


Ser 


Gin 


Ser 








65 










70 










75 










etc 


atg 


aag 


aag 


ttc 


tac 


att 


ggc 


gaa 


ttg 


etc 


ccg 


gaa 


acc 


acc 


ggc 


290 


Leu 


Met 


Lys 


Lys 


Phe 


Tyr 


He 


Gly 


Glu 


Leu 


Leu 


Pro 


Glu 


Thr 


Thr 


Gly 






80 










85 










90 












aag 


gag 


ccg 


cag 


caa 


ate 


gcc 


ttt 


gaa 


aag 


ggc 


tac 


cgc 


gat 


ctg 


cgc 


338 


Lys 


Glu 


Pro 


Gin 


Gin 


He 


Ala 


Phe 


Glu 


Lys 


Gly 


Tyr 


Arg 


Asp 


Leu 


Arg 




95 










100 










105 










110 




tec 


aaa 


etc 


ate 


atg 


atg 


ggc 


atg 


ttc 


aag 


tee 


aac 


aag 


tgg 


ttc 


tac 


386 


Ser 


Lys 


Leu 


He 


Met 


Met 


Gly 


Met 


Phe 


Lys 


Ser 


Asn 


Lys 


Trp 


Phe 


Tyr 












115 










120 










125 






gte 


tae 


aag 


tge 


etc 


age 


aac 


atg 


gee 


att 


tgg 


gee 


gee 


gee 


tgt 


get 


434 


Val 


Tyr 


Lys 


Cys 


Leu 


Ser 


Asn 


Met 


Ala 


He 


Trp 


Ala 


Ala 


Ala 


Cys 


Ala 










130 










135 










140 








etc 


gte 


ttf 


tae 


teg 


gae 


ege 


ttc 


tgg 


gta 


cac 


ctg 


gee 


age 


gee 


gte 


482 


Leu 


Val 


Phe 


Tyr 


Ser 


Asp 


Arg 


Phe 


Trp 


Val 


His 


Leu 


Ala 


Ser 


Ala 


Val 








145 










150 










155 










atg 


ctg 


gga 


aca 


ttc 


ttt 


cag 


cag 


teg 


gga 


tgg 


ttg 


gca 


cac 


gac 


ttt 


530 


Met 


Leu 


Gly 


Thr 


Phe 


Phe 


Gin 


Gin 


Ser 


Gly 


Trp 


Leu 


Ala 


His 


Asp 


Phe 






160 










165 










170 












ctg 


cae 


cae 


cag 


gte 


ttc 


acc 


aag 


ege 


aag 


cac 


ggg 


gat 


etc 


gga 


gga 


578 


Leu 


His 


His 


Gin 


Val 


Phe 


Thr 


Lys 


Arg 


Lys 


His 


Gly 


Asp 


Leu 


Gly 


Gly 




175 










180 










185 










190 




etc 


ttt 


tgg 


ggg 


aac 


etc 


atg 


cag 


ggt 


tae 


tec 


gta 


cag 


tgg 


tgg 


aaa 


626 


Leu 


Phe 


Trp 


Gly 


Asn 


Leu 


Met 


Gin 


Gly 


Tyr 


Ser 


Val 


Gin 


Trp 


Trp 


Lys 
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195 



d3C 


aag 


cac 


aa c 


gga 


cac 


Asn 


Lys 


n xo 


7\ o r-\ 

MS n 


oxy 


nxs 








210 








g t c 


gcg 


caa 


g d L 




r\±ai 


V ax 


riXa 


vaxn 


Asp 


Vjxy 






99 R 








gcc 


tgg 


tec 


gtc 


cag 


caa 




Trp 


C A V 


vax 


Pin 

oxn 


Pin 

Vjxn 




9 4 0 










gga 


aag 


ga u 


teg 


ggt 


zzg 


o J. y 


Lys 








Leu 


9 R ^ 










9 fin 


4- -H 
C L L 


rac 


4- 4< 4- 

L L L 


ccc 


=j -h ^ 

arc 


■f- 4- 

ttg 


r ne 


Tyr 


rfie 


Pro 


lie 


Leu 










97 '=i 




tec 


4- 4* /-^ 

LLC 


aag 


t ge 


gcc 


-h 4- 4- 
L L L 


Ser 


rne 


Lys 


Cys 


Aia 


rne 








9 QH 






etc 


gaa 


etc 


aag 


gcc 


aag 


Leu 


Glu 


Leu 


Lys 


Ala 


Lys 






OUD 








ggc 


ate 


ctg 


ctg 


cac 
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tggggaaccc tgtggttggc atgcacatac aaatggacga aggataaacc ttttcacgcc 
cttttaaata tccgattatt ctaataaacg ctcttttctc ttaggtttac ccgccaatat 
atcctgtcaa acactgatag tttaaactga aggcgggaaa cgacaatctg atcatgagcg 
gagaattaag ggagtcacgt tatgaccccc gccgatgacg cgggacaagc cgttttacgt 
ttggaactga cagaaccgca acgttgaagg agccactcag ccgatctgaa ttcatcgatc 
ctctagaggc gcgccgagct cctcgagcaa atttacacat tgccactaaa cgtctaaacc 
cttgtaattt gtttttgttt tactatgtgt gttatgtatt tgatttgcga taaattttta 
tatttggtac taaatttata acacctttta tgctaacgtt tgccaacact tagcaatttg 
caagttgatt aattgattct aaattatttt tgtcttctaa atacatatac taatcaactg 
gaaatgtaaa tatttgctaa tatttctact ataggagaat taaagtgagt gaatatggta 
ccacaaggtt tggagattta attgttgcaa tgctgcatgg atggcatata caccaaacat 
tcaataattc ttgaggataa taatggtacc acacaagatt tgaggtgcat gaacgtcacg 
tggacaaaag gtttagtaat ttttcaagac aacaatgtta ccacacacaa gttttgaggt 
gcatgcatgg atgccctgtg gaaagtttaa aaatattttg gaaatgattt gcatggaagc 
catgtgtaaa accatgacat ccacttggag gatgcaataa tgaagaaaac tacaaattta 
catgcaacta gttatgcatg tagtctatat aatgaggatt ttgcaatact ttcattcata 
cacactcact aagttttaca cgattataat ttcttcatag ccagcccacc gcggtgggcg 
gccgc atg gag gtc gtg gag aga ttc tac ggt gag ttg gat ggg aag gtc 
Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val 
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teg cag ggc gtg aat gca ttg ctg ggt agt ttt ggg gtg gag ttg acg 
Ser Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr 

20 25 30 

gat acg ccc act acc aaa ggc ttg ccc etc gtt gac agt ccc aca ccc 
Asp Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro 
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tatgatcgea tgatatttgc tttcaattct gttgtgcacg ttgtaaaaaa cetgagcatg 1998 

tgtagctcag atecttaccg ecggtttcgg tteattctaa tgaatatate accegttact 2058 

atcgtatttt tatgaataat attctccgtt caatttactg attgtccgte gageaaattt 2118 

acacattgee actaaaegtc taaacccttg taatttgttt ttgttttaet atgtgtgtta 2178 

tgtatttgat ttgegataaa tttttatatt tggtaetaaa tttataacac ettttatget 2238 
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tcatagccag cggatccgcc cacata atg gag aac ttc tgg tct att gtt gtg 2831 

Met Glu Asn Phe Trp Ser He Val Val 
295 



ttt 


ttt 


eta 


etc 


tea 


att 


etc 


ttc 


att 


tta 


tat 


aac 


ata 


teg 


aea 


gta 


2879 


Phe 


Phe 


Leu 


Leu 


Ser 


He 


Leu 


Phe 


He 


Leu 


Tvr 

X yi. 


Asn 


He 


Ser 


Thr 


Val 




300 










305 










310 










315 




t gc 


cac 


tac 


tat 


atg 


CQQ 

^y y 


att 


teg 


ttt 


tat 


tac 


ttc 


aca 


att 


tta 


ttg 


2927 


Cys 


His 


Tvr 


Tvr 


Met 


Arg 


He 


Ser 


Phe 


Tvr 

1 y a. 


Tvr 

X y X. 


Phe 


Thr 


He 


Leu 


Leu 












320 










325 










330 






cat 




atg 


gaa 




tat 
L-y L 


att 
y u u 


aca 


atg 


ate 


ect 


tct 


t aa 

Ly y 


eta 


aat 


aaa 
yy y 


2975 


nj. o 






O J. 


V a. X 


v^y o 


Val 


Thr 


Mpt 


He 


Pro 


Ser 


Trn 


Leu 


Asn 


Gly 










*J <J '•J 










340 










345 








aag 


ncif' 


get 


gat 


tac 


at a 
y ^y 


ttt 


cac 


teg 


ttt 


ttc 


tat 


taa 

uy y 


tat 

uy I- 


aaa 


taa 


3023 


T .\/^ 
u_y o 


v-jx _y 


Ala 


Asp 


TVT 


Val 


Phe 


His 


Ser 


Phe 


Phe 


Tvr 
X yj. 




Cys 


Lys 


Tro 








O J 










J ^ 










360 










act 


yy 


gtt 


cat 


aea 


aea 


gt c 


tat 


y y d 


tat 


gaa 


aaa 


aea 


eaa 


att 

y I- u 


gaa 


3071 


Thr 


Gly 


Val 


His 


Thr 


Thr 


Val 


Tvr 
X y j- 


Gly 


Tvr 

X y J. 


Glu 


Lys 


Thr 


Gin 


Val 


Glu 






365 










370 










375 












y y u 


ccg 


get 


gta 


att 

y u L- 


att 


tat 

L-y L. 


aa t 


cat 


eag 


agt 


tet 


etc 


gae 


att 


eta 


3119 






Ala 


Val 


Val 


He 


Cys 


Asn 


His 


Gin 


Ser 


Ser 


Leu 


Asp 


He 


Leu 




380 










385 










390 










395 




teg 


atg 


gea 


tea 


ate 


t na 

i_y y 


eeg 


aag 


aat 


tat 

uy u 


att 

y I- u 


gta 


atg 


atg 


aaa 


cga 


3167 


^f=> T 
O C i- 




Ala 


Ser 


He 




Pro 


Lys 


Asn 


Cys 


Val 


Val 


Met 


Met 


Lys 


Arg 












400 










405 










410 






att 


ctt 


gee 


tat 


att 


cea 


ttc 


ttc 


aat 


etc 


aaa 

y y » 


gee 


tac 


ttt 


tec 


aac 


3215 


He 


Leu 


Ala 


Tvr 


Val 


Pro 


Phe 


Phe 


Asn 


Leu 


Glv 


Ala 


Tvr 

J. y J. 


Phe 


Ser 


Asn 










415 










420 










425 








aca 


ate 


ttc 


ate 


gat 


cga 


tat 


aac 


egt 


gaa 


egt 


aca 


atg 


act 


tea 


gtt 


3263 


Thr 


He 


Phe 


He 


Asp 


Arg 


Tvr 


Asn 


Arg 


Glu 


Arg 


Ala 


Met 


Ala 


Ser 


Val 








430 










435 










440 










aat 


tat 


tqt 


aca 


tct 


aaa 


ata 


aaa 


aac 


aga 


aat 


ctt 


aaa 


ctt 


tgg 


gta 


3311 


Asp 


Tvr 


Cys 


Ala 


Ser 


Glu 


Met 


Lys 


Asn 


Ara 


Asn 


Leu 


Lvs 


Leu 


Trp 


Val 






445 










450 










455 












ttt 


ccg 
^ ^ ^ 


qaa 


QQa 


aea 


aga 


aat 


egt 


gaa 


gga 


ggg 


ttc 


att 


cea 


ttc 


aag 


3359 


Phe 


Pro 


Glu 


Glv 


Thr 


Arg 


Asn 


Arg 


Glu 


Glv 


Glv 


Phe 


He 


Pro 


Phe 


Lvs 




460 










465 










470 










475 




aaa 


gga 


gea 


ttc 


aat 


att 


gea 


gtt 


egt 


gcg 


eag 


att 


cec 


att 


att 


cea 


3407 


Lys 


Gly 


Ala 


Phe 


Asn 


He 


Ala 


Val 


Arg 


Ala 


Gin 


He 


Pro 


He 


He 


Pro 












480 










485 










490 






gtt 


gta 


ttc 


tea 


gac 


tat 


egg 


gat 


ttc 


tac 


tea 


aag 


cea 


ggc 


cga 


tat 


3455 


Val 


Val 


Phe 


Ser 


Asp 


Tyr 


Arg 


Asp 


Phe 


Tyr 


Ser 


Lys 


Pro 


Gly 


Arg 


Tyr 










495 










500 










505 








ttc 


aag 


aat 


gat 


gga 


gaa 


gtt 


gtt 


att 


ega 


gtt 


etg 


gat 


gcg 


att 


cea 


3503 


Phe 


Lys 


Asn 


Asp 


Gly 


Glu 


Val 


Val 


He 


Arg 


Val 


Leu 


Asp 


Ala 


He 


Pro 








510 










515 










520 










aca 


aaa 


ggg 


etc 


act 


ctt 


gat 


gac 


gte 


age 


gag 


ttg 


tet 


gat 


atg 


tgt 


3551 


Thr 


Lys 


Gly 


Leu 


Thr 


Leu 


Asp 


Asp 


Val 


Ser 


Glu 


Leu 


Ser 


Asp 


Met 


Cys 






525 










530 










535 












egg 


gac 


gtt 


atg 


ttg 


gea 


gee 


tat 


aag 


gaa 


gtt 


act 


eta 


gaa 


get 


eag 


3599 


Arg 


Asp 


Val 


Met 


Leu 


Ala 


Ala 


Tyr 


Lys 


Glu 


Val 


Thr 


Leu 


Glu 


Ala 


Gin 




540 










545 










550 










555 
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caa cga aat gcg aca egg cgt gga gaa aca aaa gac ggg aag aaa tct 3647 
Gin Arg Asn Ala Thr Arg Arg Gly Glu Thr Lys Asp Gly Lys Lys Ser 

560 565 570 

gag taa gctagcgtta accctgcttt aatgagatat gcgagacgcc tatgatcgca 3703 
Glu 



tgatatttgc tttcaattct gttgtgcacg ttgtaaaaaa cctgagcatg tgtagctcag 

atccttaccg ccggtttcgg ttcattctaa tgaatatatc acccgttact atcgtatttt 

tatgaataat attctccgtt caatttactg attgtccgtc gagcaaattt acacattgcc 

actaaacgtc taaacccttg taatttgttt ttgttttact atgtgtgtta tgtatttgat 

ttgcgataaa tttttatatt tggtactaaa tttataacac cttttatgct aacgtttgcc 

aacacttagc aatttgcaag ttgattaatt gattctaaat tatttttgtc ttctaaatac 

atatactaat caactggaaa tgtaaatatt tgctaatatt tctactatag gagaattaaa 

gtgagtgaat atggtaccac aaggtttgga gatttaattg ttgcaatgct gcatggatgg 

catatacacc aaacattcaa taattcttga ggataataat ggtaccacac aagatttgag 

gtgcatgaac gtcacgtgga caaaaggttt agtaattttt caagacaaca atgttaccac 

acacaagttt tgaggtgcat gcatggatgc cctgtggaaa gtttaaaaat attttggaaa 

tgatttgcat ggaagccatg tgtaaaacca tgacatccac ttggaggatg caataatgaa 

gaaaactaca aatttacatg caactagtta tgcatgtagt ctatataatg aggattttgc 

aatactttca ttcatacaca ctcactaagt tttacacgat tataatttct tcatagccag 

cagatctaaa atg ggc aaa gga ggg gac get egg gee teg aag gge tea 
Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser 
575 580 585 

cgc aag ate agt tgg eag gaa gte aag acc cac gcg tct 

Arg Lys lie Ser Trp Gin Glu Val Lys Thr His Ala Ser 

590 595 600 

gcc tgg ate att cac tec aat aag gte tac gac gtg tec 

Ala Trp lie lie His Ser Asn Lys Val Tyr Asp Val Ser 
605 610 615 

gaa cat ccc gga ggc gcc gte att ttc acg cac gcc ggt 

Glu His Pro Gly Gly Ala Val He Phe Thr His Ala Gly 

625 630 

acg gac att ttc get gee ttt eae gea ccc gga teg eag 

Thr Asp He Phe Ala Ala Phe His Ala Pro Gly Ser Gin 



acg gcg get 
Thr Ala Ala 

ccg gag gac 
Pro Glu Asp 

aac tgg cac 
Asn Trp His 
620 

gac gac atg 
Asp Asp Met 
635 

teg etc atg 
Ser Leu Met 
650 

gge aag gag 
Gly Lys Glu 

cgc tec aaa 
Arg Ser Lys 

tac gte tac 
Tyr Val Tyr 
700 

get etc gte 
Ala Leu Val 

715 
gte atg etg 
Val Met Leu 
730 

ttt etg cac 
Phe Leu His 

gga etc ttt 



640 



645 



aag aag ttc tac att ggc gaa ttg etc ccg gaa ace ace 

Lys Lys Phe Tyr He Gly Glu Leu Leu Pro Glu Thr Thr 

655 660 665 

ccg eag caa ate gee ttt gaa aag gge tac cgc gat etg 

Pro Gin Gin He Ala Phe Glu Lys Gly Tyr Arg Asp Leu 

670 675 680 

etc ate atg atg ggc atg ttc aag tec aac aag tgg ttc 

Leu He Met Met Gly Met Phe Lys Ser Asn Lys Trp Phe 

685 690 695 

aag tgc etc age aac atg gcc att tgg gee gee gcc tgt 

Lys Cys Leu Ser Asn Met Ala He Trp Ala Ala Ala Cys 

705 710 

ttt tac teg gac cgc ttc tgg gta eae etg gee age gcc 

Phe Tyr Ser Asp Arg Phe Trp Val His Leu Ala Ser Ala 

720 725 

gga aca ttc ttt eag eag teg gga tgg ttg gca cac gac 

Gly Thr Phe Phe Gin Gin Ser Gly Trp Leu Ala His Asp 

735 740 745 

cac eag gte ttc acc aag cgc aag cac ggg gat etc gga 

His Gin Val Phe Thr Lys Arg Lys His Gly Asp Leu Gly 

750 755 760 

tgg ggg aac etc atg cag ggt tac tec gta cag tgg tgg 



3763 
3823 
3883 
3943 
4003 
4063 
4123 
4183 
4243 
4303 
4363 
4423 
4483 
4543 
4592 



4640 



4688 



4736 



4784 



4832 



4880 



4928 



4976 



5024 



5072 



5120 



5168 
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Gly 


Leu 


Phe 


Trp 


Glv 

sj J. jr 


Asn 


Leu 


Met 


Gin 


Gly 


Tyr 


Ser 


Val 


Gin 


Trp 


Trp 










765 










770 










775 








aaa 


aac 


aaq 


cac 


aac 


aaa 


cac 


cac 


gee 


gtc 


ccc 


aac 


etc 


cac 


tge 


tec 


5216 


Lvs 


Asn 


Lvs 


His 


Asn 


Glv 

VJ J. 


His 


His 


Ala 


Val 


Pro 


Asn 


Leu 


His 


Cys 


Ser 








780 










785 










790 










tec 


gea 


gtc 


gcg 


caa 


qat 


qqq 

3 3 3 


gae 


ecg 


gae 


ate 


gat 


ace 


atg 


ccc 


ett 


5264 


Ser 


Ala 


Val 


Ala 


Gin 


Asp 


Gly 


Asp 


Pro 


Asp 


He 


Asp 

It 


Thr 


Met 


Pro 


Leu 






795 










800 










805 












etc 


gee 


tqq 


tee 


qtc 


caq 


caa 


gee 


cag 


tct 


tac 


egg 


gaa 


etc 


caa 


gee 


5312 


Leu 


Ala 


Tro 


Ser 


Val 


Gin 


Gin 


Ala 


Gin 


Ser 


Tvr 


Ara 


Glu 


Leu 


Gin 


Ala 




810 , 










815 










820 










825 




qac 


qqa 


aaq 


qat 


teq 


qqt 


ttq 


gtc 


aag 


ttc 


atg 


ate 


egt 


aae 


caa 


tec 


5360 


Asp 


Glv 


Lys 


Asp 


Ser 


Glv 

vj J- _y 


Leu 


Val 


Lys 


Phe 


Met 


He 


Arq 


Asn 


Gin 


Ser 












830 










835 










840 






tac 


ttt 


tac 


ttt 


ccc 


ate 


ttq 


ttq 

^ ^ 3 


etc 


qcc 


cqc 

^3 


ctq 

^ 3 


teq 


tqq 

*" 3 3 


ttq 


aac 


5408 


Tvr 


Phe 


Tvr 


Phe 


Pro 


lie 


Leu 


Leu 


Leu 


Ala 


Arq 


Leu 


Ser 


Trp 


Leu 


Asn 










845 










850 










855 








aaa 


tec 


ttc 


aag 


tac 


ace 


ttt 


qqq 


ett 


qqa 

3 3^ 


qct 


qeq 

3^3 


teq 

^ 3 


qaq 

3^3 


aac 


qct 

3 ^ ^ 


5456 


Glu 


Ser 


Phe 


Lys 


Cys 


Ala 


Phe 


Glv 


Leu 


Glv 


Ala 


Ala 


Ser 


Glu 


Asn 


Ala 








860 










865 










870 










get 


etc 


gaa 


etc 


aag 


gee 


aag 


aat 


ett 


caa 


tac 


ccc 


ett 


ttq 


qaa 

3 


aaq 


5504 


Ala 


Leu 


Glu 


Leu 


Lys 


Ala 


Lys 


Glv 


Leu 


Gin 


Tvr 


Pro 


Leu 


Leu 


Glu 


Lvs 






875 










880 










885 












act 


aac 


ate 


eta 


etg 


cac 


tac 


act 


taa 

^33 


atq 


ett 


aca 


qtt 


teq 


tec 


qqe 

3 3^ 


5552 


Ala 


Glv 


lie 


Leu 


Leu 


His 


Tvr 


Ala 


Tro 


Met 


Leu 


Thr 


Val 


Ser 


Ser 


Glv 

vj J. _y 




890 










895 










900 










905 




ttt 


aaa 


cac 


ttc 


tea 


ttc 


aca 


tac 


acc 


aca 


ttt- 


tac 


ttt 


eta 


acc 


qeq 


5600 


Phe 


Glv 


Arg 


Phe 


Ser 


Phe 


Ala 


Tvr 


Thr 


Ala 


Phe 


Tvr 


Phe 


Leu 


Thr 


Ala 












910 










915 










920 






acc 


aca 


tec 


tat 


aaa 
y y ° 


ttc 


tta 


etc 


gee 


att 


gtc 


ttt 


aac 


etc 


aac 


cac 


5648 


Thr 


Ala 


Ser 


Cys 


Glv 


Phe 


Leu 


Leu 


Ala 


lie 


Val 


Phe 


Glv 


Leu 


Glv 


His 










925 










930 










935 








aac 


aac 


atg 


gee 


acc 


t ae 


aat 


gee 


gae 


gee 


egt 


eeg 


gae 


ttc 


tqq 

*^ 3 3 


aaq 

3 


5696 


Asn 


Glv 

w J. jr 


Met 


Ala 


Thr 


Tvr 


Asn 


Ala 


ASD 


Ala 


Arq 


Pro 


Asp 


Phe 


Trp 


Lvs 








940 










945 










950 










etc 


caa 


gtc 


ace 


aeg 


act 


CQC 

^ 3 ^ 


aae 


qte 

3 ^ ^ 


acq 

^ ^ 3 


qqe 


qqa 

3 3 


cac 


qqt 

3 3^ 


ttc 


ccc 


5744 


Leu 


Gin 


Val 


Thr 


Thr 


Thr 


Arg 


Asn 


Val 


Thr 


Gly 


Gly 


His 


Gly 


Phe 


Pro 






955 










960 










965 












caa 


gee 


ttt 


gtc 


gae 


tgg 


ttc 


tgt 


ggt 


gge 


etc 


eag 


tac 


caa 


gtc 


gae 


5792 


Gin 


Ala 


Phe 


Val 


Asp 


Trp 


Phe 


Cys 


Gly 


Gly 


Leu 


Gin 


Tyr 


Gin 


Val 


Asp 




970 










975 










980 










985 




cac 


cac 


tta 


ttc 


ccc 


age 


etg 


ccc 


ega 


cac 


aat 


etg 


gee 


aag 


aca 


cac 


5840 


His 


His 


Leu 


Phe 


Pro 


Ser 


Leu 


Pro 


Arg 


His 


Asn 


Leu 


Ala 


Lys 


Thr 


His 





990 995 1000 

gea etg gtc gaa teg ttc tge aag gag tgg ggt gtc eag tac cac 5885 
Ala Leu Val Glu Ser Phe Cys Lys Glu Trp Gly Val Gin Tyr His 

1005 1010 1015 

gaa gee gae ett gtg gae ggg acc atg gaa gtc ttg cac eat ttg 5930 
Glu Ala Asp Leu Val Asp Gly Thr Met Glu Val Leu His His Leu 

1020 1025 1030 

gge age gtg gee gge gaa ttc gtc gtg gat ttt gta egc gat gga 5975 
Gly Ser Val Ala Gly Glu Phe Val Val Asp Phe Val Arg Asp Gly 

1035 1040 1045 

ccc gee atg taa agatctgccg gcatcgatcc egggceatgg cetgetttaa 6027 
Pro Ala Met 

tgagatatgc gagacgecta tgatcgcatg atatttgett teaattctgt tgtgcacgtt 6087 
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gtaaaaaacc tgagcatgtg tagctcagat ccttaccgcc ggtttcggtt cattctaatg 6147 

aatatatcac ccgttactat cgtattttta tgaataatat tctccgttca atttactgat 6207 

tgtccgtcga cgagctcggc gcgccgtcga cctgcaggca tgcaagcttc acgctgccgc 62 67 

aagcactcag ggcgcaaggg ctgctaaagg aagcggaaca cgtagaaagc cagtccgcag 6327 

aaacggtgct gaccccggat gaatgtcagc tactgggcta tctggacaag ggaaaacgca 6387 

agcgcaaaga gaaagcaggt agcttgcagt gggcttacat ggcgatagct agactgggcg 64 4 7 

gttttatgga cagcaagcga accggaattg ccagctgggg cgccctctgg taaggttggg 6507 

aagccctgca aagtaaactg gatggctttc ttgccgccaa ggatctgatg gcgcagggga 6567 

tcaagatcat gagcggagaa ttaagggagt cacgttatga cccccgccga tgacgcggga 6627 

caagccgttt tacgtttgga actgacagaa ccgcaacgtt gaaggagcca ctcagccgcg 6687 

ggtttctgga gtttaatgag ctaagcacat acgtcagaaa ccattattgc gcgttcaaaa 6747 

gtcgcctaag gtcactatca gctagcaaat atttcttgtc aaaaatgctc cactgacgtt 6807 

ccataaattc ccctcggtat ccaattagag tctcatattc actctcaatc cagatctcga 6867 

ctctagtcga gggcccatgg gagcttggat tgaacaagat ggattgcacg caggttctcc 6927 

ggccgcttgg gtggagaggc tattcggcta tgactgggca caacagacaa tcggctgctc 6987 

tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg gttctttttg tcaagaccga 7047 

cctgtccggt gccctgaatg aactgcagga cgaggcagcg cggctatcgt ggctggccac 7107 

gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact gaagcgggaa gggactggct 7167 

gctattgggc gaagtgccgg ggcaggatct cctgtcatct caccttgctc ctgccgagaa 7227 

agtatccatc atggctgatg caatgcggcg gctgcatacg cttgatccgg ctacctgccc 7287 

attcgaccac caagcgaaac atcgcatcga gcgagcacgt actcggatgg aagccggtct 7347 

tgtcgatcag gatgatctgg acgaagagca tcaggggctc gcgccagccg aactgttcgc 7407 

caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc gtgacccatg gcgatgcctg 7467 

cttgccgaat atcatggtgg aaaatggccg cttttctgga ttcatcgact gtggccggct 7527 

gggtgtggcg gaccgctatc aggacatagc gttggctacc cgtgatattg ctgaagagct 7587 

tggcggcgaa tgggctgacc gcttcctcgt gctttacggt atcgccgctc ccgattcgca 7647 

gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga gcgggaccca agctagcttc 7707 

gacggatccc ccgatgagct aagctagcta tatcatcaat ttatgtatta cacataatat 7767 

cgcactcagt ctttcatcta cggcaatgta ccagctgata taatcagtta ttgaaatatt 7827 

tctgaattta aacttgcatc aataaattta tgtttttgct tggactataa tacctgactt 7887 

gttattttat caataaatat ttaaactata tttctttcaa gatgggaatt aattcactgg 7947 

ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg 8007 

cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt 8067 

cccaacagtt gcgcagcctg aatggcgccc gctcctttcg ctttcttccc ttcctttctc 8127 

gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga 8187 

tttagtgctt tacggcacct cgaccccaaa aaacttgatt tgggtgatgg ttcacgtagt 8247 

gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat 8307 

agtggactct tgttccaaac tggaacaaca ctcaacccta tctcgggcta ttcttttgat 8367 

ttataaggga ttttgccgat ttcggaacca ccatcaaaca ggattttcgc ctgctggggc 8427 

aaaccagcgt ggaccgcttg ctgcaactct ctcagggcca ggcggtgaag ggcaatcagc 8487 

tgttgcccgt ctcactggtg aaaagaaaaa ccaccccagt acattaaaaa cgtccgcaat 8547 

gtgttattaa gttgtctaag cgtcaatttg tttacaccac aatatatcct gccaccagcc 8607 

agccaacagc tccccgaccg gcagctcggc acaaaatcac cactcgatac aggcagccca 8 667 

tcagtccggg acggcgtcag cgggagagcc gttgtaaggc ggcagacttt gctcatgtta 8727 

ccgatgctat tcggaagaac ggcaactaag ctgccgggtt tgaaacacgg atgatctcgc 8787 

ggagggtagc atgttgattg taacgatgac agagcgttgc tgcctgtgat caaatatcat 8847 

ctccctcgca gagatccgaa ttatcagcct tcttattcat ttctcgctta accgtgacag 8907 

gctgtcgatc ttgagaacta tgccgacata ataggaaatc gctggataaa gccgctgagg 8967 

aagctgagtg gcgctatttc tttagaagtg aacgttgacg atatcaactc ccctatccat 9027 

tgctcaccga atggtacagg tcggggaccc gaagttccga ctgtcggcct gatgcatccc 9087 

cggctgatcg accccagatc tggggctgag aaagcccagt aaggaaacaa ctgtaggttc 9147 

gagtcgcgag atcccccgga accaaaggaa gtaggttaaa cccgctccga tcaggccgag 9207 

ccacgccagg ccgagaacat tggttcctgt aggcatcggg attggcggat caaacactaa 9267 

agctactgga acgagcagaa gtcctccggc cgccagttgc caggcggtaa aggtgagcag 9327 

aggcacggga ggttgccact tgcgggtcag cacggttccg aacgccatgg aaaccgcccc 9387 

cgccaggccc gctgcgacgc cgacaggatc tagcgctgcg tttggtgtca acaccaacag 9447 

cgccacgccc gcagttccgc aaatagcccc caggaccgcc atcaatcgta tcgggctacc 9507 
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tagcagagcg gcagagatga acacgaccat cagcggctgc acagcgccta ccgtcgccgc 9567 
gaccccgccc ggcaggcggt agaccgaaat aaacaacaag ctccagaata gcgaaatatt 9627 
aagtgcgccg aggatgaaga tgcgcatcca ccagattccc gttggaatct gtcggacgat 9687 
catcacgagc aataaacccg ccggcaacgc ccgcagcagc ataccggcga cccctcggcc 9747 
tcgctgttcg ggctccacga aaacgccgga cagatgcgcc. ttgtgagcgt ccttggggcc 9807 
gtcctcctgt ttgaagaccg acagcccaat gatctcgccg tcgatgtagg cgccgaatgc 98 67 
cacggcatct cgcaaccgtt cagcgaacgc ctccatgggc tttttctcct cgtgctcgta 9927 
aacggacccg aacatctctg gagctttctt cagggccgac aatcggatct cgcggaaatc 9987 

ctgcacgtcg gccgctccaa gccgtcgaat ctgagcctta atcacaattg tcaattttaa 10047 

tcctctgttt atcggcagtt cgtagagcgc gccgtgcgtc ccgagcgata ctgagcgaag 10107 

caagtgcgtc gagcagtgcc cgcttgttcc tgaaatgcca gtaaagcgct ggctgctgaa 10167 

cccccagccg gaactgaccc cacaaggccc tagcgtttgc aatgcaccag gtcatcattg 10227 

acccaggcgt gttccaccag gccgctgcct cgcaactctt cgcaggcttc gccgacctgc 10287 

tcgcgccact tcttcacgcg ggtggaatcc gatccgcaca tgaggcggaa ggtttccagc 10347 

ttgagcgggt acggctcccg gtgcgagctg aaatagtcga acatccgtcg ggccgtcggc 10407 

gacagcttgc ggtacttctc ccatatgaat ttcgtgtagt ggtcgccagc aaacagcacg 10467 

acgatttcct cgtcgatcag gacctggcaa cgggacgttt tcttgccacg gtccaggacg 10527 

cggaagcggt. gcagcagcga caccgattcc aggtgcccaa cgcggtcgga cgtgaagccc 10587 

atcgccgtcg cctgtaggcg cgacaggcat tcctcggcct tcgtgtaata ccggccattg 10647 

atcgaccagc ccaggtcctg gcaaagctcg tagaacgtga aggtgatcgg ctcgccgata 10707 

ggggtgcgct tcgcgtactc caacacctgc tgccacacca gttcgtcatc gtcggcccgc 107 67 

agctcgacgc cggtgtaggt gatcttcacg tccttgttga cgtggaaaat gaccttgttt 10827 

tgcagcgcct cgcgcgggat tttcttgttg cgcgtggtga acagggcaga gcgggccgtg 10887 

tcgtttggca tcgctcgcat cgtgtccggc cacggcgcaa tatcgaacaa ggaaagctgc 10947 

atttccttga tctgctgctt cgtgtgtttc agcaacgcgg cctgcttggc ctcgctgacc 11007 

tgttttgcca ggtcctcgcc ggcggttttt cgcttcttgg tcgtcatagt tcctcgcgtg 11067 

tcgatggtca tcgacttcgc caaacctgcc gcctcctgtt cgagacgacg cgaacgctcc 11127 

acggcggccg atggcgcggg cagggcaggg ggagccagtt gcacgctgtc gcgctcgatc 11187 

ttggccgtag cttgctggac catcgagccg acggactgga aggtttcgcg gggcgcacgc 11247 

atgacggtgc ggcttgcgat ggtttcggca tcctcggcgg aaaaccccgc gtcgatcagt 11307 

tcttgcctgt atgccttccg gtcaaacgtc cgattcattc accctccttg cgggattgcc 11367 

ccgactcacg ccggggcaat gtgcccttat tcctgatttg acccgcctgg tgccttggtg 11427 

tccagataat ccaccttatc ggcaatgaag tcggtcccgt agaccgtctg gccgtccttc 11487 

tcgtacttgg tattccgaat cttgccctgc acgaatacca gcgacccctt gcccaaatac 11547 

ttgccgtggg cctcggcctg agagccaaaa cacttgatgc ggaagaagtc ggtgcgctcc 11607 

tgcttgtcgc cggcatcgtt gcgccacatc taggtactaa aacaattcat ccagtaaaat 11667 

ataatatttt attttctccc aatcaggctt gatccccagt aagtcaaaaa atagctcgac 11727 

atactgttct tccccgatat cctccctgat cgaccggacg cagaaggcaa tgtcatacca 11787 

cttgtccgcc ctgccgcttc tcccaagatc aataaagcca cttactttgc catctttcac 11847 

aaagatgttg ctgtctccca ggtcgccgtg ggaaaagaca agttcctctt cgggcttttc 11907 

cgtctttaaa aaatcataca gctcgcgcgg atctttaaat ggagtgtctt cttcccagtt 11967 

ttcgcaatcc acatcggcca gatcgttatt cagtaagtaa tccaattcgg ctaagcggct 12027 

gtctaagcta ttcgtatagg gacaatccga tatgtcgatg gagtgaaaga gcctgatgca 12087 

ctccgcatac agctcgataa tcttttcagg gctttgttca tcttcatact cttccgagca 12147 

aaggacgcca tcggcctcac tcatgagcag attgctccag ccatcatgcc gttcaaagtg 12207 

caggaccttt ggaacaggca gctttccttc cagccatagc atcatgtcct tttcccgttc 12267 

cacatcatag gtggtccctt tataccggct gtccgtcatt tttaaatata ggttttcatt 12327 

ttctcccacc agcttatata ccttagcagg agacattcct tccgtatctt ttacgcagcg 12387 

gtatttttcg atcagttttt tcaattccgg tgatattctc attttagcca tttattattt 12447 

ccttcctctt ttctacagta tttaaagata ccccaagaag ctaattataa caagacgaac 12507 

tccaattcac tgttccttgc attctaaaac cttaaatacc agaaaacagc tttttcaaag 12567 

ttgttttcaa agttggcgta taacatagta tcgacggagc cgattttgaa accacaatta 12627 

tgggtgatgc tgccaactta ctgatttagt gtatgatggt gtttttgagg tgctccagtg 12687 

gcttctgtgt ctatcagctg tccctcctgt tcagctactg acggggtggt gcgtaacggc 12747 

aaaagcaccg ccggacatca gcgctatctc tgctctcact gccgtaaaac atggcaactg 12807 

cagttcactt acaccgcttc tcaacccggt acgcaccaga aaatcattga tatggccatg 12867 

aatggcgttg gatgccgggc aacagcccgc attatgggcg ttggcctcaa cacgatttta 12927 
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cgtcacttaa 
tcatcgtctg 
gctggctgtt 
aacgcactat 
tatggatgac 
tcagcaagcg 
cacggctggg 
tcgggcatta 
agaatttaca 
tatgctttgc 
aatccttaca 
atatattcaa 
aaatatggcg 
ataatcttac 
cggcacccgc 
agtccgtgaa 
cactgcccgg 
cgggcgtcgc 
ggactgccag 
ggggggatgg 
ttcggcgtgc 
attggtttaa 
ttgcaaatgc 
catctgtcag 
gcgcccctca 
gggaaactcg 
gccggccgaa 
caagtgtcaa 
caacgccggc 
gggccataga 
gcgctcggtc 
cttgatggag 
gctaaaaaag 
cgtcctgctt 
ccgtgcgcgg 
cattgatgcg 
cctggccgac 
caatcgctct 
ttggcttggt 
agcctcgcag 
aggaacaccc 
tacaccaagg 
atggatatac 
gccacgcttc 
ggagagcgca 
tttcgccacc 
tggaaaaacg 
cacatgttct 
tgagctgata 
gcggaagagc 
gggcagggca 
ggaggggatg 
ctgatcaatc 
ccaatccatc 
cgaaggcgtc 
cgcgctcgcc 
agctgattgt 



aaaactcagg 
cgcggaaatg 
ttacgcgtat 
ggcgacgctg 
ggatggctgg 
atatacgcag 
acggaagtcg 
tctgaacata 
agctataagg 
ccattctata 
tacggcgata 
ggcaatctgc 
cttcatagag 
ctatcacctc 
gaccactatg 
tgccccgacg 
cacctggtcg 
gctcgggctg 
cgctgccatt 
gaggcccgcg 
gcggtcacgc 
aagcaggtta 
tggattttct 
cactctgccc 
agtgtcaata 
cgtaaaatca 
atcgagcctg 
cgtccgcccc 
ggccgcggtg 
cggccgccag 
ttgccttgct 
cgcatgggga 
tcatggctct 
ctcttcgatc 
gtcgtcggtg 
ggccagctcg 
ggccagcagg 
tcgttcgtct 
ttcatcagcc 
agcaggattc 
gctcgcgggt 
aaagtctaca 
cgaaaaaatc 
ccgaagggag 
cgagggagct 
tctgacttga 
ccagcaacgc 
ttcctgcgtt 
ccgctcgccg 
gccagaaggc 
tgaaaaagcc 
ttgtctacat 
gtcacccttt 
gacaatcacc 
tatcgcggcc 
ggcatcgctg 
catcagcgca 



ccgcagtcgg 
gacgaacagt 
gacagtctcc 
gggcgtctta 
ccgctgtatg 
cgaattgagc 
ctgtcgttct 
aaacactatc 
ttattgtcct 
gatatattga 
tcttctatat 
ctcctcatcc 
taattctgta 
aaatggttcg 
ccaagaatgc 
gccgaagtga 
ctgaatgtcg 
atcgcccatc 
tttggggtga 
ttagcgggcc 
gcacagggcg 
aaagacaggt 
gcctgtggac 
ctcaagtgtc 
ccgcagggca 
ggcgttttcg 
cccctcatct 
tcatctgtca 
tctcgcacac 
cccagcggcg 
cgtcggtgat 
cgtgcttggc 
gccctcgggc 
ttcgccagca 
agccagagtt 
cggacgtgct 
taggccgaca 
ggaaggcagt 
atccgcttgc 
ccgttgagca 
gggcctactt 
cgaacccttt 
gctataatga 
aaaggcggac 
tccaggggga 
gcgtcgattt 
ggccttttta 
atcccctgat 
cagccgaacg 
cgccagagag 
cgtagcgggc 
ggctctgctg 
ctcggtcctt 
gcgagtccct 
cgcaacagcg 
tcgccggcct 
ttgacggcgt 



taacctcgcg 

ggggctatgt 

ggaagacggt 

tgagcctgct 

aatcccgcct 

ggcataacct 

caaaatcggt 

aataagttgg 

gggtttcaag 

taagcgcgct 

aaaagatata 

tcttcatcct 

aaggtccaat 

ctgggtttat 

ccaaggtaaa 

agggcaggcc 

atgccagcac 

ccgttactgc 

ggccgttcgc 

gggagggttc 

cagccctggt 

tagcggtggc 

agcccctcaa 

aaggatcgcg 

cttatcccca 

ccgatttgcg 

gtcaacgccg 

gtgagggcca 

ggcttcgacg 

agggcaacca 

gtacttcacc 

aatcacgcgc 

ggaccacgcc 

gggcgaggat 

tcagcaggcc 

catagtccac 

ggctcatgcc 

acaccttgat 

cctcatctgt 

ccgccaggtg 

cacctatcct 

ggcaaaatcc 

ccccgaagca 

aggtatccgg 

aacgcctggt 

ttgtgatgct 

cggttcctgg 

tctgtggata 

accgagcgca 

gccgagcgcg 

tgctacgggc 

tagtgagtgg 

caacgttcct 

gctcgaacgc 

gcgagagcgg 

gctcctcaag 

ccccggccga 



catacagccg 
cggggctaaa 
tgttgcgcac 
gtcacccttt 
gaagggaaag 
gaatctgagg 
ggagctgcat 
agtcattacc 
cattagtcca 
gcctatgcct 
ttatcttatc 
cttcgtcttg 
tctcgttttc 
cgcacccccg 
aattgccggc 
gccacccagg 
ctgcggcacg 
cccgatcccg 
ggccgagggg 
gagaaggggg 
taaaaacaag 
cgaaaaacgg 
atgtcaatag 
cccctcatct 
ggcttgtcca 
aggctggcca 
cgccgggtga 
agttttccgc 
gcgtttctgg 
gcccggtgag 
agctccgcga 
accccccggc 
catcatgacc 
cgtggcatca 
gcccaggcgg 
gacgcccgtg 
ggccgccgcc 
aggtgggctg 
tacgccggcg 
cgaataaggg 
gcccggctga 
tgtatatcgt 
gggttatgca 
taagcggcag 
atctttatag 
cgtcaggggg 
ccttttgctg 
accgtattac 
gcgagtcagt 
gccgtgaggc 
gtctgacgcg 
gttgcgctcc 
gacaacgagc 
tgcgtccgga 
agcctgttca 
cacggcccca 
aaaacccgcc 



ggcagtgacg 
tcgcgccagc 
gtattcggtg 
gacgtggtga 
ctgcacgtaa 
cagcacctgg 
gacaaagtca 
caattatgat 
tgcaagtttt 
tgccccctga 
agtattgtca 
gtagcttttt 
atacctcggt 
aacacgagca 
cccgccatga 
ccgccgccct 
tcaatgcttc 
gcaatggcaa 
cgcagcccct 
ggcacccccc 
gtttataaat 
gcggaaaccc 
gtgcgcccct 
gtcagtagtc 
catcatctgt 
gctccacgtc 
gtcggcccct 
gaggtatcca 
cgcgtttgca 
cgtcgcaaag 
agtcgctctt 
cgttttagcg 
ttgccaagct 
ccgaaccgcg 
cccaggtcgc 
attttgtagc 
gccttttcct 
cccttcctgg 
gtagccggcc 
acagtgaaga 
cgccgttgga 
gcgaaaaagg 
gcggaaaagc 
ggtcggaaca 
tcctgtcggg 
gcggagccta 
gccttttgct 
cgcctttgag 
gagcgaggaa 
ttggacgcta 
gtggaaaggg 
ggcagcggtc 
ctccttttcg 
ccggcttcgt 
acggtgccgc 
acagtgaagt 
tcgcagagga 



12987 
13047 
13107 
13167 
13227 
13287 
13347 
13407 
13467 
13527 
13587 
13647 
13707 
13767 
13827 
13887 
13947 
14007 
14067 
14127 
14187 
14247 
14307 
14 367 
14427 
14487 
14547 
14607 
14667 
14727 
14787 
14847 
14907 
14967 
15027 
15087 
15147 
15207 
15267 
15327 
15387 
15447 
15507 
15567 
15627 
15687 
15747 
15807 
15867 
15927 
15987 
16047 
16107 
16167 
16227 
16287 
16347 
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agcgaagctg cgcgtcggcc gtttccatct gcggtgcgcc cggtcgcgtg ccggcatgga 16407 

tgcgcgcgcc atcgcggtag gcgagcagcg cctgcctgaa gctgcgggca ttcccgatca 164 67 

gaaatgagcg ccagtcgtcg tcggctctcg gcaccgaatg cgtatgattc tccgccagca 16527 

tggcttcggc cagtgcgtcg agcagcgccc gcttgttcct gaagtgccag taaagcgccg 16587 

gctgctgaac ccccaaccgt tccgccagtt tgcgtgtcgt cagaccgtct acgccgacct 16647 

cgttcaacag gtccagggcg gcacggatca ctgtattcgg ctgcaacttt gtcatgcttg 16707 

acactttatc actgataaac ataatatgtc caccaactta tcagtgataa agaatccgcg 16767 

cgttcaatcg gaccagcgga ggctggtccg gaggccagac gtgaaaccca acatacccct 16827 

gatcgtaatt ctgagcactg tcgcgctcga cgctgtcggc atcggcctga ttatgccggt 16887 

gctgccgggc ctcctgcgcg atctggttca ctcgaacgac gtcaccgccc actatggcat 16947 

tctgctggcg ctgtatgcgt tggtgcaatt tgcctgcgca cctgtgctgg gcgcgctgtc 17007 

ggatcgtttc gggcggcggc caatcttgct cgtctcgctg gccggcgcca gate 17061 
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<212> PRT 

<213> Phaeodactylum tricornutum, Physcomitrella patens, Caenorhabditis elegans 
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X ^ ^ 










Asn 


Leu 


oiy 


iiia 


Tyr 


rne 


Q /-I V" 

oer 


Asn 


i nr 


lie 


rne 


X xe 




rixg 


iyr 




130 










135 










140 










Asn 


Arg 


Glu 


Arg 


Ala 


Met 


Ala 


Ser 


Val 


Asp 


Tyr 


Cys 


Ala 


Ser 


Glu 


Met 


145 










150 










155 










160 


Lys 


Asn 


Arg 


Asn 


Leu 


Lys 


Leu 


Trp 


Val 


Phe 


Pro 


Glu 


Gly 


Thr 


Arg 


Asn 










165 










170 










175 




Arg 


Glu 


Gly 


Gly 


Phe 


He 


Pro 


Phe 


Lys 


Lys 


Gly 


Ala 


Phe 


Asn 


He 


Ala 








180 










185 










190 






Val 


Arg 


Ala 


Gin 


He 


Pro 


He 


He 


Pro 


Val 


Val 


Phe 


Ser 


Asp 


Tyr 


Arg 






195 










200 










205 








Asp 


Phe 


Tyr 


Ser 


Lys 


Pro 


Gly 


Arg 


Tyr 


Phe 


Lys 


Asn 


Asp 


Gly 


Glu 


Val 




210 










215 










220 










Val 


He 


Arg 


Val 


Leu 


Asp 


Ala 


He 


Pro 


Thr 


Lys 


Gly 


Leu 


Thr 


Leu 


Asp 


225 










230. 










235 










240 


Asp 


Val 


Ser 


Glu 


Leu 


Ser 


Asp 


Met 


Cys 


Arg 


Asp 


Val 


Met 


Leu 


Ala 


Ala 










245 










250 










255 




Tyr 


Lys 


Glu 


Val 


Thr 


Leu 


Glu 


Ala 


Gin 


Gin 


Arg 


Asn 


Ala 


Thr 


Arg 


Arg 








260 










265 










270 






Gly 


Glu 


Thr 


Lys 


Asp 


Gly 


Lys 


Lys 


Ser 


Glu 















275 280 



<210> 74 

<211> 477 

<212> PRT 

<213> Phaeodactylum tricornutum, Physcomitrella patens, Caenorhabditis elegans 

<400> 74 



Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser Thr Ala Ala 



114 



1 








5 










10 










15 




Arg 


Lvs 


He 


Ser 


Trp 


Gin 


Glu 


Val 


Lvs 


Thr 


His 


Ala 


Ser 


Pro 


Glu 


Asp 








20 










25 










30 






Ala 


Trp 


He 


He 


His 


Ser 


Asn 


Lvs 


Val 


Tvr 


Asp 


Val 


Ser 


Asn 


Trp 


His 






35 










40 










45 








Glu 


His 


Pro 


Gly 


Gly 


Ala 


Val 


He 


Phe 


Thr 


His 


Ala 


Gly Asp Asp 


Met 




50 










55 










60 










Thr 


Asp 


He 


Phe 


Ala 


Ala 


Phe 


His 


Ala 


Pro 


Gly Ser 


Gin 


Ser 


Leu 


Met 


65 










70 










75 










80 


Lys 


Lys 


Phe 


Tvr 


He 


Glv 

J. jr 


Glu 


Leu 


Leu 


Pro 


Glu 


Thr 


Thr 


Gly 


Lvs 


Glu 










85 










90 










95 




Pro 


Gin 


Gin 


lie 


Ala 


Phe 


Glu 


Lvs 


Glv 

NJ J. Jf 


Tvr 


Arg Asp 


Leu 


Arq 


Ser 


Lvs 

jjy ^ 








100 










105 










110 






Leu 


He 


Met 


Met 


Gly 


Met 


Phe 


Lvs 


Ser 


Asn 


Jjy3 




Phe 


Tvr 


Val 


Tvr 

J: 






115 










120 










125 








Lys 


Cys 


Leu 


Ser 


Asn 


Met 


Ala 


He 


Trp 


Ala 




AT a 

/A_L Ct 


Cvs 


Ala 


Leu 


Val 




130 










135 










JL \J 










Phe 


Tyr 


Ser 


Asp 


Arg 


Phe 


Trp 


Val 


His 


Leu 


Alp 


C cj y- 


Ala 


Val 


Met 


Leu 


145 










150 










J. ^ <J 










160 


Gly Thr 


Phe 


Phe 


Gin 


Gin 


Ser 


Gly 


Tro 


Leu 


Ala 


n J~ o 


Asp 


Phe 


Leu 


His 










165 










170 










175 




Hi c: 
rij. o 




Val 


Phe 


Thr 


Lvs 


Arg 


Lvs 


His 


Glv 






Gly 


Gly 


Leu 


Phe 








180 










185 










190 






J. J.]-; 


Gly 


Asn 


Leu 


Met 


Gin 


Glv 


Tvr 


Ser 


Val 


\3 J. 11 


1 rp 


Trn 


Lys 


Asn 


Lys 






195 










200 










205 








Hi c; 


Asn 


Glv 


His 


His 


Ala 


Val 


Pro 


Asn 


Leu 


His 


Cys 


Ser 


Ser 


Ala 


Val 




210 










215 










220 










Ala 


Gin 


Asp 


Gly 


Asp 


Pro 


Asp 


He 


Asp 


Thr 


Met 


Pro 


Leu 


Leu 


Ala 


Trn 


225 










230 










235 










240 


Ser 


Val 


Gin 


Gin 


Ala 


Gin 


Ser 


Tvr 


Arg 


Glu 


Leu 


Gin 


Ala 


Asp 


Gly 


Lys 










245 










250 










255 




Asp 


Ser 


Glv 


Leu 


Val 


Lys 


Phe 


Met 


He 


Arg 


Asn 


Gin 


Ser 


Tvr 


Phe 


Tvr 








260 










265 










270 






Phe 


Pro 


He 


Leu 


Leu 


Leu 


Ala 


Arg 


Leu 


Ser 


Trp 


Leu 


Asn 


Glu 


Ser 


Phe 






275 










280 










285 








Lys 


Cys 


Ala 


Phe 


Glv 


Leu 


Glv 


Ala 


Ala 


Ser 


Glu 


Asn 


Ala 


Ala 


Leu 


Glu 




290 










295 










300 










Leu 


Lys 


Ala 


Lvs 


Glv 


Leu 


Gin 


Tvr 


Pro 


Leu 


Leu 


Glu 


Lvs 


Ala 


Gly 


He 


305 










310 










315 










320 


Leu 


Leu 


His 


Tvr 


Ala 


Trp 


Met 


Leu 


Thr 


Val 


Ser 


Ser 


Glv 


Phe 


Glv 


Arg 










325 










330 










335 




Phe 


Ser 


Phe 


Ala 


Tvr 


Thr 


Ala 


Phe 


Tvr 


Phe 


Leu 


Thr 


Ala 


Thr 


Ala 


Ser 








340 










345 










350 






Cys 


Gly 


Phe 


Leu 


Leu 


Ala 


He 


Val 


Phe 


Gly 


Leu Gly 


His 


Asn 


Gly 


Met 






355 










360 










365 








Ala 


Thr 


Tvr 


Asn 


Ala 


Asp 


Ala 


Arg 


Pro 


Asp 


Phe 


Trp 


Lys 


Leu 


Gin 


Val 




370 










375 










380 










Thr 


Thr 


Thr 


Arq 


Asn 


Val 


Thr 


Gly 


Gly 


His 


Gly 


Phe 


Pro 


Gin 


Ala 


Phe 


385 










390 










395 










400 


Val 


Asp 


Trp 


Phe 


Cvs 


Glv 


Glv 


Leu 


Gin 


Tvr 


Gin 


Val 


Asp 


His 


His 


Leu 










405 










410 










415 




Phe 


Pro 


Ser 


Leu 


Pro 


Arg 


His 


Asn 


Leu 


Ala 


Lys 


Thr 


His 


Ala 


Leu 


Val 








420 










425 










430 






Glu 


Ser 


Phe 


Cys 


Lys 


Glu 


Trp 


Gly 


Val 


Gin 


Tyr 


His 


Glu Ala Asp 


Leu 






435 










440 










445 








Val 


Asp 


Gly 


Thr 


Met 


Glu 


Val 


Leu 


His 


His 


Leu Gly 


Ser 


Val 


Ala 


Gly 




450 










455 










460 
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Glu Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 
465 470 475 

<210> 75 

<211> 47 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1)..(47) 

<223> USPl upstream primer 

<400> 75 

ccggaattcg gcgcgccgag ctcctcgagc aaatttacac attgcca 

<210> 76 

<211> 47 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1) . . (47) 

<223> USP2 upstream 

<400> 76 

ccggaattcg gcgcgccgag ctcctcgagc aaatttacac attgcca 

<210> 77 

<211> 47 

<212> DNA 

<213> unknown 



<220> 

<221> misc_feature 

<222> (1) . . (47) 

<223> USP3 upstream 

<400> 77 

ccggaattcg gcgcgccgag ctcctcgagc aaatttacac attgcca 

<210> 78 

<211> 48 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 
<222> (1) . . (48) 
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<223> USPl downstream 

<400> 78 

aaaactgcag gcggccgccc accgcggtgg gctggctatg aagaaatt 

<210> 79 

<211> 27 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1)..(27) 

<223> USP2 downstream 

<400> 79 

cgcggatccg ctggctatga agaaatt 



<210> 80 

<211> 45 

<212> DNA 

<213> unknown 



<220> 

<221> niisc_f eature 

<222> (I).. (45) 

<223> USP3 downstream 

<400> 80 

tcccccggga tcgatgccgg cagatctgct ggctatgaag aaatt 



<210> 81 

<211> 40 

<212> DNA 

<213> unknown 



<220> 

<221> mi sc_f eature 

<222> {1)..(40) 

<223> OCSl upstream 

<400> 81 

aaaactgcag tctagaaggc ctcctgcttt ■ aatgagatat 



<210> 82 

<211> 51 

<212> DNA 

<213> unknown 



117 



<220> 

<221> misc_f eature 

<222> (1)..(51) 

<223> 0CS2 upstream 

<400> 82 

cgcggatccg atatcgggcc cgctagcgtt aaccctgctt taatgagata t 



<210> 83 

<211> 33 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1)..(33)' 

<223> 0CS3 upstream 

<400> 83 

tcccccgggc catggcctgc tttaatgaga tat 

<210> 84 

<211> 53 

<212> DNA 

<213> unknown 



<220> 

<221> misc_feature 

<222> (1) . . (53) 

<223> OCSl downstream 

<400> 84 

cccaagcttg gcgcgccgag ctcgaattcg tcgacggaca atcagtaaat tga 

<210> 85 

<211> 53 

<212> DNA 

<213> unknown 



<220> 

<221> mi sc_f eature 

<222> (1)..(53) 

<223> 0CS2 downstream 

<400> 85 

cccaagcttg gcgcgccgag ctcgaattcg tcgacggaca atcagtaaat tga 

<210> 86 

<211> 47 

<212> DNA 
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<213> unknown 



<220> 

<221> misc_feature 

<222> (1) . . (47) 

<223> 0CS3 downstream 



<400> 86 

cccaagcttg gcgcgccgag ctcgtcgacg gacaatcagt aaattga 47 



<210> 87 
<211> 29 
<212> DNA 



<213> unknown 



<220> 

<221> misc_feature 

<222> (1)..(29) 

<223> 5 T06E8.1f 



<400> 87 

acataatgga gaacttctgg tcgatcgtc 29 



<210> 88 
<211> 24 
<212> DNA 



<213> unknown 



<220> 

<221> misc_feature 

<222> (1) . . (24) 

<223> 3 T06E8.1r 



<400> 88 

ttactcagat ttcttcccgt cttt 24 



<210> 89 
<211> 26 
<212> DNA 



<213> unknown 



<220> 

<221> misc_f eature 

<222> {1)..(26) 

<223> 5 F59F4.4f 



<400> 89 

acataatgac cttcctagcc atatta 



26 



<210> 90 

<211> 24 

<212> DNA 

<213> unknown 



<220> 

<221> misc_feature 

<222> (1) . . (24) 

<223> 3 F59F4.4r 

<400> 90 

tcagatattc aaattggcgg cttc 

<210> 91 

<211> 32 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1) . . (32) 

<223> ARe503f 

<400> 91 

ttaagcgcgg ccgcatggag aacttctggt eg 

<210> 92 

<211> 31 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1)..(31) 

<223> ARe504r 

<400> 92 

acctcggcgg ccgccctttt actcagattt c 

<210> 93 

<211> 41 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1)..{41) 

<223> Forward primer 



<400> 



93 
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acataatgga gaacttctgg 



<210> 94 

<211> 41 

<212> DNA 

<213> unknown 



<220> 

<221> misc^f eature 

<222> (1) . . (41) 

<223> Reverse primer 

<400> 94 

ctagctagct tactcagatt 



<210> 95 

<211> 18 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1)..(18) 

<223> primer 

<400> 95 

caggaaacag ctatgacc 

<210> 96 

<211> 19 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1)..(19) 

<223> primer 

<400> 96 

ctaaagggaa caaaagctg 

<210> 97 

<211> 18 

<212> DNA 

<213> unknown 



<220> 

<221> misc feature 



tctattgttg tgttttttct 



tcttcccgtc ttttgtttct 



<222> (1) . . (18) 
<223> primer 



<400> 97 

tgtaaaacga cggccagt 

<210> 98 

<211> 18 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1) . . (18) 

<223> LPAAT069-5 

<400> 98 

gctacattgc catggagc 



<210> 99 

<211> 19 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1) . . (19) 

<223> LPAAT069-3 

<400> 99 

gctacaagag gtcaggtcg 

<210> 100 

<211> 25 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1)..(25) 

<223> ACtrau-5 

<400> 100 

ctggatccat gagcgcgtgg acgag 



<210> 101 

<211> 25 

<212> DNA 

<213> unknown 



<220> 

<221> misc^f eature 

<222> (1)..{25) 

<223> ACtrau-3 

<400> 101 

ttggatccca agaggtcagg tcgga 



<210> 102 

<211> 26 

<212> DNA 

<213> unknown 



<220> 

<221> inisc_f eature 

<222> (1)..(26) 

<223> ACtrau-3 stop 

<400> 102 

ttggatccct acaagaggtc aggtcg 

<210> 103 

<211> 22 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> {1)..{22) 

<223> YES-HIS-5 

<400> 103 

ctgagctcat gagcgcgtgg ag 

<210> 104 

<211> 35 

<212> DNA 

<213> unknown 



<220> 

<221> misc_feature 

<222> (1) . . (35) 

<223> YES-HIS-3 

<400> 104 

atggatccgt gatggtgatg gtgatgcaag aggtc 



<210> 105 
<211> 18 
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<212> DNA 
<213> unknown 



<220> 

<221> misc^f eature 

<222> (1)..(18) 

<223> LPAAT069-5 

<400> 105 
gctacattgc catggagc 



<210> 106 

<211> 19 

<212> DNA 

<213> unknown 



<220> 

<221> misc_feature 

<222> {1)..{19) 

<223> LPAAT069-3 

<400> 106 

gctacaagag gtcaggtcg 



<210> 107 

<211> 41 

<212> DNA 

<213> unknown 

<220> 

<223> primer 



<220> 

<221> misc_f eature 

<222> {1)..(41) 

<223> MaLPAATl.l 

<400> 107 

atggatgaat ccaccacgac catcagcccg atgcttgctg 



<210> 108 

<211> 40 

<212> DNA 

<213> unknown 



<220> 

<221> mi sc_f eature 

<222> (1) . . (40) 

<223> MaLPAATl.2 
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<400> 108 

atgaacccta tctacaaggg ttcagcccga tgcttgctgc 



<210> 109 

<211> 37 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (I).. (37) 

<223> ShLPAAT 

<400> 109 

atgttactgc tagcatttgt ttactttgcc attaagg 



<210> 110 

<211> 46 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1)..(46) 

<223> T6 

<400> 110 

atgagcgcgt ggacgagggc ctacaagagg tcaggtcgga cgtaca 

<210> 111 

<211> 40 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> {1)..(40) 

<223> Pp00406404 

<400> 111 

atggctttga tgtatatctg ttacacgatt tttcttttag 



<210> 112 

<211> 41 

<212> DNA 

<213> unknown 



<220> 

<221> raise feature 
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<222> (1)..{41) 
<223> Pp02006422 

<400> 112 

atgctgatat tacagccctt cctaatgaac aggaagaccg 



<210> 113 

<211> 39 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> {1)..(39) 

<223> Pp01505214 

<400> 113 

atgatccgga ttttcagagt cagtccgttt tgccgaggt 



<210> 114 

<211> 39 

<212> DNA 

<213> unknown 



<220> 

<221> misc_feature 

<222> (1) . . (39) 

<223> Pp00403422 

<400> 114 

atgccgtcgc tgtttcgggt caatcagttc gcctgcttc 



<210> 115 

<211> 41 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1)..{41) 

<223> Pp00410427 

<400> 115 

atgctgatat tacagccctt cctaatgaac aggaagaccg 



<210> 116 

<211> 39 

<212> DNA 

<213> unknown 
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<220> 

<221> misc^f eature 

<222> (1)..{39) 

<223> Pp02001815 

<400> 116 

atgaccagca cggaaaatac ctagatgtta gtttcactc 



<210> 117 

<211> 40 

<212> DNA 

<213> unknown 



<220> 

<221> misc_feature 

<222> (1)..(40) 

<223> Pp01503434 

<400> 117 

atgattatga tggaggtgct gtcagtccgt tttgccgagg 



<210> 118 

<211> 40 

<212> DNA 

<213> unknown 



<220> 

<221> misc_feature 

<222> (1)..(40) 

<223> Pp01503336 

<400> 118 

atgtgttcaa tttcttgtgg ttagtggaac ataagctgtt 



<210> 119 

<211> 41 

<212> DNA 

<213> unknown 



<220> 

<221> misc_f eature 

<222> (1) . . (41) 

<223> Fg003028298 

<400> 119 

atgggaaagt ccactttacc tatgaagtct cctcatcatc 



<210> 120 
<211> 27 
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<212> DNA 
<213> unknown 



<220> 

<221> misc_f eature 

<222> (1) . . (27) 

<223> GSP 

<400> 120 

tctctttttc gtgctgctcc agccgat 27 



<210> 121 

<211> 52 

<212> DNA 

<213> unknown 

<220> 

<223> primer 



<220> 

<221> misc_f eature 

<222> (1)..(52) 

<223> 5 primer attlThLPAAT 

<400> 121 

ggggacaagt ttgtacaaaa aagcaggctc catgagcgcg tggacgaggg cc 52 

<210> 122 

<211> 72 

<212> DNA 

<213> unknown 



<220> 

<221> mi sc_f eature 

<222> (1)..(72) 

<223> 3 primer att2ThLPAAT 

<400> 122 

ggggaccact ttgtacaaga aagctgggtc tagtggtggt ggtggtggtg caagaggtca 60 



ggtcggacgt ac 



72 



